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(57)Abstract: 



PROBLEM TO BE SOLVED: To provide a motion estimate method, with motion 
precision having adaptability that includes at least a technology for a motion vector 
with high pixel precision and less increase in the calculation quantity, 
SOLUTION: One technology employs a fast search strategy in a partial pixel space, to 
smartly retrieve an optimum motion vector. Other technology uses a different 
interpolation filter in a different stage, to estimate a motion vector with high precision, 
so as to reduce the complexity of calculation. Still another technology employs rate 
distortion criterion applied according to a different motion precision, to decide both an 
optimum motion vector and the optimum motion precision. Furthermore, yet another 
technology uses a VLC table interpreted differently in a different coding unit, according 
to the related motion vector accuracy. 
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CLAIMS 



[Claim(s)] 

[Claim 1] By finding out the optimal motion vector to one macro block It is the search 
approach with motion precision with the fast search adaptability for presuming the 
motion vector in motion compensation animation coding (a). In order to find out the 
optimal motion vector V2 Step which searchs the 1st set of the motion vector 
candidate of centering on VI square within the limits defined beforehand on a partial 
pixel resolution grid (b) In order to find out the optimal motion vector V3 Step which 
searchs the 2nd set of the motion vector candidate of centering on V2 square within 
the limits defined beforehand on a partial pixel resolution grid (c) In order to find out 
said optimal motion vector of said macro block The search approach with motion 
precision with the fast search adaptability for presuming the motion vector 
characterized by changing including the step which searchs the 3rd set of the motion 
vector candidate of centering on V3 square within the limits defined beforehand on a 
partial pixel resolution grid. 

[Claim 2] In order to find out the optimal motion vector V2. said step which searchs 
the set [ 1st ] motion vector candidate of centering on VI square within the limits 
defined beforehand on a partial pixel resolution grid In order to find out the optimal 
motion vector V2 The search approach with motion precision with the fast search 
adaptability for presuming the motion vector according to claim 1 characterized by 
including further the step which searchs the 1 st set of eight motion vector candidates 
in the square range 1 centering on VI on a 1/2-pixel resolution grid. 
[Claim 3] In order to find out the optimal motion vector V3, said step which searchs 
the set [ 2nd ] motion vector candidate of centering on V2 square within the limits 
defined beforehand on a partial pixel resolution grid In order to find out the optimal 
motion vector V3 The search approach with motion precision with the fast search 
adaptability for presuming the motion vector according to claim 1 characterized by 



including further the step which searchs the 2nd set of eight motion vector candidates 
in the square range 1 centering on V2 on a 1/6-pixel resolution grid. 
[Claim 4] The search approach with motion precision with the fast search adaptability 
for presuming the motion vector according to claim 1 characterized by including 
further the step which uses V2 as a motion vector to said block when V2 has the 
minimum rate distortion cost, and the step which skips the step (c) of claim 1. 
[Claim 5] In order to find out said optimal motion vector of said macro block Said step 
which searchs the set [ 3rd ] motion vector candidate of centering on V3 square 
within the limits defined beforehand on a partial pixel resolution grid In order to find 
out said optimal motion vector of said macro block The search approach with motion 
precision with the fast search adaptability for presuming the motion vector according 
to claim 1 characterized by including further the step which searchs the 3rd set of 
eight motion vector candidates in the square range 1 centering on V3 on a 1/6-pixel 
resolution grid. 

[Claim 6] In order to find out said optimal motion vector of said macro block Said step 
which searchs the set [ 3rd ] motion vector candidate of centering on V3 square 
within the limits defined beforehand on a partial pixel resolution grid The search 
approach with motion precision with the fast search adaptability for presuming the 
motion vector according to claim 1 characterized by including further the step which 
skips the candidate [ finishing / test / already ] in said set [ 3rd ] motion vector 
candidate. 

[Claim 7] Said step which searchs said set [ 1st ] motion vector candidate The step 
which searchs said set [ 1st ] motion vector candidate using the 1st filter in order to 
perform 1st interpolation is included further. Said step which searchs said set [ 2nd ] 
motion vector candidate The step which searchs said set [ 2nd ] motion vector 
candidate using the 2nd filter in order to perform 2nd interpolation is included further. 
Said step which searchs said set [ 3rd ] motion vector candidate The search approach 
with motion precision with the fast search adaptability for presuming the motion 
vector according to claim 1 characterized by including further the step which searchs 
said set [ 3rd ] motion vector candidate using the 3rd filter in order to perform 3rd 
interpolation. 

[Claim 8] In order to find out the optimal motion vector V3, said step which searchs 
the set [ 2nd ] motion vector candidate of centering on V2 square within the limits 
defined beforehand on a partial pixel resolution grid (a) The step which searchs 1 / 2 
pel location which has the minimum RD cost next with three candidates V2 of 1 / 3 
pel precision when there is V2 at the core, (b) When V2 is the vector of a corner Step 



which searchs 4 vector candidates of 1 / 3 pel precision nearest to V2 (c) When V2 is 
between two corner vectors A corner with lower RD cost in two corners is determined. 
The search approach with motion precision with the fast search adaptability for 
presuming the motion vector according to claim 1 characterized by including further 
the step which searchs 4 vector candidates of 1 / 3 pel precision nearest to Rhine 
between said corners with lower RD cost. 

[Claim 9] By finding out the optimal motion vector to one macro block It is the search 
approach with an adaptable motion precision for presuming the motion vector in 
motion compensation animation coding (a). In order to find out the optimal motion 
vector V2 using the 1st filter for performing the 1st interpolation Step which searchs 
the set [ 1st ] motion vector candidate in the grid centering on VI (b) In order to find 
out the optimal motion vector V3 using the 2nd filter for performing the 2nd 
interpolation Step which searchs the set [ 2nd ] motion vector candidate in the grid 
centering on V2 (c) In order to find out said optimal motion vector of said macro block 
using the 3rd filter for performing the 3rd interpolation The search approach with an 
adaptable motion precision for presuming the motion vector characterized by turning 
to V3 including the step which searchs the set [ 3rd ] motion vector candidate in the 
grid which has a core. 

[Claim 10] Said step searched using the 1st filter for performing the 1st interpolation 
is the search approach with an adaptable motion precision for presuming the motion 
vector according to claim 9 characterized by including further use of the easy filter for 
performing coarse interpolation. 

[Claim 1 1] Said step which searchs using the 2nd filter for performing the 2nd 
interpolation, including further use of an easy filter for said step searched using the 
1st filter for performing the 1st interpolation to perform coarse interpolation is the 
search approach with an adaptable motion precision for presuming the motion vector 
according to claim 9 characterized by to include further use of the composite filter for 
performing fine interpolation. 

[Claim 12] Said step searched using the 3rd filter for performing the 3rd interpolation 
is the search approach with an adaptable motion precision for presuming the motion 
vector according to claim 1 1 characterized by including further use of the composite 
filter for performing fine interpolation. 

[Claim 13] Said step searched using the 1st filter for performing the 1st interpolation 
is the search approach with an adaptable motion precision for presuming the motion 
vector according to claim 9 characterized by including further use of the congruence 
linearity filter for interpolating the reference frame of 2x2. 



[Claim 14] Said step searched using the 1st filter for performing the 1st interpolation 
Said step searched using the 2nd filter for performing the 2nd interpolation, including 
use of a congruence linearity filter further in order to interpolate the reference frame 
of 2x2 The search approach with an adaptable motion precision for presuming the 
motion vector according to claim 9 characterized by including further use of the cube 
filter for performing fine interpolation. 

[Claim 15] Said step searched using the 3rd filter for performing the 3rd interpolation 
is the search approach with an adaptable motion precision for presuming the motion 
vector according to claim 14 characterized by including further use of the cube filter 
for performing fine interpolation. 

[Claim 16] By finding out the optimal motion vector to one macro block In the search 
approach with an adaptable motion precision for presuming the motion vector in 
motion compensation animation coding (a) The step which searchs the 1st [to the 1st 
optimal motion vector of said macro block ] motion precision, (b) Said 1st optimal 
motion vector and step which encodes said 1st motion precision (c) The step which 
searchs 2nd at least one optimal motion vector of said macro block in 2nd at least one 
motion precision, (d) The step which encodes said 2nd at least one optimal motion 
vector and said 2nd at least one motion precision, (e) The search approach with an 
adaptable motion precision for presuming the motion vector characterized by 
changing including said 1st [ the ] and the step which chooses said optimal motion 
vector using a rate distortion criterion from at least one optimal motion vectors. 
[Claim 17] It is the search approach with an adaptable motion precision for presuming 
the motion vector according to claim 16 characterized by said optimal step which 
makes motion vector selection containing further the step to which said rate 
distortion criterion is fitted according to motion precision which is different in order 
[ said ] to determine both said optimal motion vectors and said optimal motion 
precision using a rate distortion criterion. 

[Claim 18] Said step which searchs 2nd at least one optimal motion vector in 2nd at 
least one motion precision is the search approach with an adaptable motion precision 
for presuming the motion vector according to claim 16 characterized by including 
further the step which is 2nd at least one motion precision finer than said 1st motion 
precision, and searchs 2nd at least one optimal motion vector. 

[Claim 19] It is the search approach with an adaptable motion precision for presuming 
the motion vector according to claim 16 characterized by said optimal step which 
makes motion vector selection containing further the step which uses the rate 
distortion criterion of a "distortion +Lx bit" mold in order [ said ] to choose said 



optimal motion vector using a rate distortion criterion. 

[Claim 20] By finding out the optimal motion vector to one macro block It is the 
search approach with an adaptable motion precision for presuming the motion vector 
in motion compensation animation coding (a). The step which searchs one motion 
precision over the one optimal motion vector of said macro block, (b) The code from 
the variable-length sign (VLC) table interpreted by differing in the coding unit which 
changes with related motion vector precision is used. Step which encodes said motion 
precision (c) The search approach with an adaptable motion precision for presuming 
the motion vector characterized by changing including the step which encodes said 
optimal motion vector in each precision space. 

[Claim 21] By finding out one optimal motion vector to one macro block, it is a system 
for presuming the motion vector in motion compensation animation coding (a). In order 
to find out the optimal motion vector V2 The 1 st encoder for searching the set [ 1 st ] 
motion vector candidate of centering on V1 square within the limits defined 
beforehand on a partial pixel resolution grid (b) In order to find out the optimal motion 
vector V3 The 2nd encoder for searching the set [ 2nd ] motion vector candidate of 
centering on V2 square within the limits defined beforehand on a partial pixel 
resolution grid, (c) In order to find out said optimal motion vector of said macro block 
The system for presuming the motion vector characterized by changing including the 
3rd encoder for searching the set [ 3rd ] motion vector candidate of centering on V3 
square within the limits defined beforehand on a partial pixel resolution grid. 
[Claim 22] The system for presuming the motion vector according to claim 21 
characterized by said 1st encoder, the 2nd encoder, and the 3rd encoder being single 
encoders. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] Especially this invention relates to the effective approach of 
presuming the motion vector in motion compensation animation coding, and encoding 
about bit compression or the coding approach of a digital animation. 
[0002] 

[Description of the Prior Art] In the conventional motion presumption approach, the 
present frame to encode is called "a macro block", for example, is divided into an 
image block with a same size of 16x16 pixels. An encoder searchs the frame 
("reference frame") inner block encoded before being most in agreement with the 
present macro block for every ** macro block. The coordinate movement magnitude 
during the macro block which is [ in one present macro block and reference frame ] 
most in agreement is expressed by the two-dimensional vector ("motion vector") of a 
macro block. Each component of a motion vector is measured per pixel. 
[0003] For example, when the same location has the reference macro block which is 
most in agreement with the present macro block, as a quiescence background image 
is a typical example, the motion vectors of the present macro block are (0, 0). Motion 
vectors when a reference macro block most in agreement is discovered on the right 
from the coordinate of the present macro block in the location of 3 pixels at 2 pixels 
and a top are (2, 3). ** — it is said that a motion vector [ like ] has the precision of an 
integer pixel (or a "integer pel" or a "perfect pel") since the horizontal component X 
and the vertical component Y have the integer pixel value. In drawing 1 , vector VI = (1 
1) expresses the motion vector of the perfect pel to the present macro block of 
arbitration. 

[0004] The migration object in an animation scene does not move to a frame per 
integer pixel increment from a frame, carrying out a deer. A true motion takes the real 
number value which met in X and the direction of Y. Therefore, the reference macro 
block which is well in agreement with the present macro block has often discovered by 
searching the macro block which is [ in the frame which interpolated the front frame 
by the factor (factor) of NxN, and then interpolated it ] most in agreement. A motion 
vector can take the delta value of a 1-/N pixel along with X and Y, and if it has the 



precision of a 1-/N pixel (or 1-/N pel), it will be expressed. 

[0005] "The response to the proposal appeal to H.26L specification (Response to Call 
for Proposals for H.26Lr (Q. 15/SG16, document Q15-F-1 1. Seoul November, 1998) 
of an ITU telecommunication standardization section and "strengthening 
(Enhancement of theTelenor Proposal for H.26L) of the proposal of Telenor to H.26L 
specification" (Q. — 15/SG16, document Q15-G-25, and Monterey — ) of an ITU 
telecommunication standardization section It will set in February. 1999 and is Gisle. 
Bjoitegaard proposed the motion vector of 1 / 3 pel precision, and use of cube-like 
interpolation about animation coding specification H.26L ("Telenor encoder"), or 
[ that a cube-like interpolation filter is used for a Telenor encoder, and it interpolates 
a reference frame 3x3 times in order to carry this out ] — or "a rise sampling" is 
carried out. This interpolation version needs 9 times as much memory as a reference 
frame. In one given macro block, a Telenor encoder presumes the optimal motion 
vector at two steps. That is, an encoder searches for the optimal integer pel vector 
first, and searches for vector VI /3 of the optimal 1/3-pixel precision near V1 next. In 
the example of drawing 1 . a total of 8 blocks of (16x16 pixels) of a 3x3 interpolation 
reference frame is investigated, and the block most in agreement that is a block 
relevant to motion vector VI / 3=(VX. VY) = (31+1/1) like illustration is found out. 
This Telenor encoder has some problems, in order [ first, ] to calculate the motion 
vector of 1 / 3 pel precision — a part — it is using the optimal fast search strategy 
and a complicated cube filter (setting on a whole page story). As a result the 
calculated motion vector requires storage capacity and computational complexity that 
it is not the optimal and immense, and attaches them very at an expensive price, 
furthermore, this Telenor encoder chooses a better motion precision, using 
[ therefore ] the precision by the effective rate distortion criterion fixed to 1/3 pixel 
— as — it cannot be adapted. Similarly, since the variable-length sign ("VLC") table 
of this Telenor encoder has the precision fixed to 1/3 pixel, it cannot perform an 
interpretation which cannot be adapted to a different precision and is different. 
[0006] Although a motion vector is presumed and it encodes with 1/2-pixel precision 
by the animation compression approach learned best, the reason is that according to 
old research it had suggested that additional compression gain was not acquired only 
by count becoming complicated in the motion precision which has adaptability more or 
it was more high. However, research old [ these ] did not presume a motion vector 
using the optimization rate distortion criterion, it did not use the convex property of 
criteria [ like ] in order to decrease complication of count, and it did not use the 
effective strategy which encodes the precision of a motion vector and a motion 



vector. 

[0007] ** — as one of the researches old [ like ] — Bernd The paper (it is henceforth 
written as the paper of Girod) "motion compensation prediction 
(Motion-Compensating Prediction with Fractional-Pel Accuracy) by fraction pel 
precision" (the IEEE communication link report, volume [ 41st ] No. 4. 604 - 612 pages, 
April, 1993) of Girod is mentioned. The paper of this Girod is the first fundamental 
analysis about the advantage which uses the motion precision of a partial pixel 
(subpixel) for animation coding. In order to search for the optimal motion vector in 
partial pixel space, the easy hierarchy strategy was used for Girod. in order that he 
may choose the optimal motion vector to a given precision again — an arithmetic 
average — absolute — the criteria of difference ("MAD") were used. Since this 
optimal precision is based on the idealized assumption, it is chosen using an 
impractical formula, and it is very complicated and all motion vectors are restricted to 
a thing with the same precision in one frame. Finally. Girod did not mention the usage 
of the bit for encoding a motion vector only paying attention to prediction error energy. 
[0008] As research old [ other ]. it is Smita. Gupta and Allen The paper (it is 
henceforth written as "the paper of Gupta") "motion presumption (On Fractional 
Pixel Motion Estimation) of a fraction pixel" (a SPIE VOIP report, 2094 pages [ 408 - 
419 ], Cambridge. November. 1993) of Gersho is mentioned. The paper of this Gupta 
has presented the approach of calculating and choosing a motion vector and encoding 
in partial pixel precision, for animation compression, the formula based on an average 
square error ("MSE") and congruence linear interpolation in the paper of Gupta 
indicating — this formula — using — an ideal motion vector — finding out — ** 
the vector [ like ] was quantized for a desired motion precision. This optimal motion 
vector to a given precision was determined using the partial optimal MSE criterion, 
and this optimal precision was chosen using the approach of carrying out the 
maximum reduction of the energy difference per distortion bit. this is the criterion (a 
part — the optimal) for which it craved. Coding of a given motion vector was 
performed by the approach of encoding in 1 / 2 pel precision first, and then encoding 
to high degree of accuracy more in a circumstantiation bit. Coding of rough ** detailed 
** is disagreeable ****** which needs a remarkable bit. 

[0009] The optimal motion vector precision in the case of the motion compensation 
animation encoder based on a paper" block 0 [ On the Optimal Motion Vector 
Accuracy for Block-Based Motion-Compensated ] In Video Coders" (the 
compression report of an IST/SPIE digital animation: an algorithm, a technique, 302 - 
314 pages, San Jose. February. 1996) (it is henceforth written as the paper of Ribas) 



Jordi Ribas-Corbera and David L Neuhoff modeled the effect of the bit rate on 
motion precision, and proposed some methods of presuming the optimal precision 
which makes a bit rate min. The paper of this Ribas described the full search approach 
for calculating the motion vector to arbitration precision, and took only congruence 
linear interpolation into consideration. The optimal motion vector was discovered by 
making MSE into min, and the optimal precision was chosen using some formulas 
drawn by optimization of rate distortion. To real-time equipment, mounting encoded a 
motion vector and precision using the difficult complicated frame adaptation entropy 
encoder. 

[0010] a paper — " — the proposal 0 of the new core experiment about prediction 
strengthening with a high bit rate [ Proposal for a new core experiment on Prediction 
enhancement at higher ] bitrates" (coding ISO/IEC JTC1/SC29/WG1 1 animation and 
voice) The performance evaluation of the mounting equipment which mitigated the 
complexity for MPEG 97/1827, SEBIRIA. February. 1997, and "1 / 4 pel motion 
compensation () [ Performance Evaluatior of a Reduced Complexity ] In Implemetatior 
for Quarter Pel Motion Compensatior" (coding of ISO/IEC JTC1/SC29/WG1 1 
animation and voice, MPEG 97/3146, San Jose, January. 1998) Urich Benzler Use of 
the motion vector of 1 / 4 pel precision and use of the interpolation filter with which 
MPEG4 animation coding criteria progressed further were proposed for the animation 
sequence. However. Benzler used the fast search method of Girod for discovery of 
the motion vector of 1/4 pel. Although Benzler did not take a different interpolation 
filter into consideration, it proposed using a easier filter in the 2nd step in the 1st step 
using a composite filter, and interpolated one macro block at once. Although a lot of 
[ this approach ] cache memory is not needed, it is complicated, and computational 
complexity is great, in order to calculate all motion vectors in all possible mode (for 
example, 16x16, 4-8x8, 16-4x4 grade) per 1 / 4 pel precision within one macro block 
and to determine the optimal mode. Benzler used the MAD criterion and discovered 
the optimal motion vector fixed to 1 / 4 pel precision to all sequences. Therefore, the 
approach of choosing the optimal motion precision was not shown. Finally, Benzler 
encoded the motion vector using the usable variable-length sign ("VLC") table, in 
order to encode the vector of 1/2 and 1/4-pixel precision. 
[0011] 

[Problem(s) to be Solved by the Invention] the rate distortion criteria which optimized 
above-mentioned bibliography — using — a motion vector — not presuming — ** — 
the complexity of count is not mitigated using the convex property of criteria [ like ]. 
Furthermore, the effective strategy which encodes a motion vector and precision is 



not used for such bibliographies. 
[0012] 

[Means for Solving the Problem] One operation gestalt of this invention solves the 
problem of the conventional technique by calculating the motion vector of high pixel 
precision (called a "fraction" or "partial pixel" precision) by the increment in slight 
computational complexity. 

[0013] If the strategy of this invention is used, it is clear by experiment that an 
animation encoder can attain remarkable compression gain (for example, reduction of 
the bit rates which reach to 30% as compared with classic selection of motion 
precision) on equivalent count level. Since it is adapted for motion precision and is 
calculated and chosen as it, this invention can be described as an adaptation motion 
precision ("AMA") method. 

[0014] One of the desirable operation gestalten of this invention searchs the optimal 
motion vector smartly using the fast search strategy in partial pixel (subpixel) space. 
This technique presumes the motion vector at the time of motion compensation 
animation coding by discovering the optimal motion vector to 1 macro block. Within 
the grid of the partial pixel resolution of the square range which has a core in VI and 
which was appointed beforehand, the 1st step searchs a set [ 1st ] motion vector 
candidate, and finds out the optimal motion vector V2. Next, within a grid with the 
partial pixel resolution of the square range which has a core in V2 and which was 
appointed beforehand, it looks for a set [ 2nd ] motion vector candidate, and the 
optimal motion vector V3 is found out. Then, within a grid with the partial pixel 
resolution of the square range which has a core in V3 and which was appointed 
beforehand, it looks for a set [ 3rd ] motion vector candidate, and the optimal motion 
vector of a macro block is found out. 

[0015] In another desirable operation gestalt of this invention, a highly precise motion 
vector presumption technique can reduce the complexity of count using a different 
interpolation filter in a different phase. 

[0016] Another desirable operation gestalt of this invention chooses the optimal 
vector and precision about rate distortion (RD). This operation gestalt determines 
both the optimal motion vector and the optimal motion precision using the rate 
distortion criterion which corresponds according to a different motion precision. 
[0017] variable length coding (VLC) with still more nearly another, effective desirable 
operation gestalt of this invention — a motion vector and precision are encoded by 
law. This technique uses the VLC table according to a related motion vector precision 
in which an interpretation which is different in a different coding unit is possible. 



[0018] **** and other purposes, the description, and advantage of this invention will 
be easily understood, if the following detailed explanation of this invention is read with 
reference to an accompanying drawing. 
[0019] 

[Embodiment of the Invention] Although it moves in each image block, and precision is 
changed and being explained, many approaches of this invention can be applied also 
when changing precision for every immobilization or frame to all sequences. This 
invention is described again as what uses the animation encoder (and specially 
encoder of Telenor) of Telenor, as the background of invention was described. 
Although explained using the vocabulary of the animation encoder of Telenor, the 
technique described here is applicable to other motion compensation animation 
encoders of arbitration. 

[0020] Many of animation encoders use the motion vector and congruence linear 
interpolation of half-pixel (or "1/2 pel") precision. 1 / 2 pel motion vector, and 
congruence linear interpolation are used for the 1st version of the encoder of Telenor. 
However, the latest version of the encoder of Telenor builds in 1 / 3 pel vector, and 
the cube form interpolation function, in order to acquire the further compression gain. 
Especially, in one given macro block, the encoder of Telenor presumes the optimal 
motion vector at two steps shown in drawing 2 . First, this Telenor encoder searches 
for the optimal integer pel vector VI ( drawing 1 ) (step 100). Next, this Telenor 
encoder searchs 1/3 pel precision vector VI /3 with the VI optimal neighborhood 
( drawing 1 ) (step 102). this 2nd step finds out the block which it all (it has the pixel 
train of each 16x16) comes out in a 3x3 interpolation (interpolation) reference frame, 
investigates eight blocks, and is most in agreement, as shown in the graph of drawing 
1 . The motion vector of eight blocks is expressed as eight real points into the grid 
which has a core in VI. In drawing 1 , the block relevant to motion vector VI / 3=(VX, 
VY) = (3 1+1/1) is most in agreement. 

[0021] According to the technique of this invention, an encoder can choose precision 
using either a full search strategy or a fast search strategy between the groups (for 
example, 1/2. 1/3, and 1 / 6 pel precision motion vector) of the motion precision of 
arbitration. 

[0022] (Search strategy of a full search AMA method) As shown in drawing 3 and 
drawing 4 , in the case of this full search adaptation motion precision (AMA) method 
search strategy, a Telenor encoder searchs all the motion vector candidates on five 
the "(it defines as a square block specified with the number of upper part pixels, the 
number of lower part pixels, and the number of pixels of both sides) square range" of a 



pixel and the grids of 1/6-pixel resolution which are shown in drawing 3 . As shown in 
drawing 4 . at the 1st step (104) of the full search AMA. the optimal integer pel vector 
VI ( drawing 1 ) is searched. In the 2nd step (106) of the full search AMA, an encoder 
searchs 1/6 pixel precision vector VI /6 with the VI optimal neighborhood ( drawing 
3 ). If it puts in another way, this full search AMA will change the 2nd step of a Telenor 
process, and the motion vector candidate in the partial pixel location of the others 
[ encoder ] in velocity space will also enable it to search it. This purpose is discovering 
the optimal motion vector in a grid, i.e., the vector which directs the block (inside of a 
interpolation reference frame) which is most in agreement with the present macro 
block. Although count is complicated since this full search strategy searchs 120 
partial pixel candidates, the total potentia of this operation gestalt of this invention is 
shown. 

[0023] The important technical problem in a motion vector search is selection of the 
judgment measure or criteria of deciding the block which coincides with a given macro 
block most, actual — almost all approaches — an average square error (MSE) or an 
average — absolute — which criterion of difference (MAD) is used. MSE during two 
blocks subtracts the pixel value of two blocks, squares a pixel value difference, and 
takes the average. MAD during two blocks — difference — square — except for 
calculating the absolute value of a pixel value difference instead of count, it is the 
same distortion measure. If two image blocks are mutually similar, the value of MSE 
and MAD is small. However, these values are large if an image block is not similar. 
Therefore, a typical animation encoder finds out the macro block which coincides with 
a macro block most by choosing the motion vector which brings about any of the 
minimum MSE or the minimum MAD they are. In other words, the block relevant to the 
optimal motion vector is a block of most approximation in a given macro block in MSE 
or MAD. 

[0024] Though regrettable, the distortion measure of MSE and MAD is not taking into 
consideration the cost of the bit which actually encodes a vector. For example, 
although MSE is made as for one arbitration motion vector to min, the cost encoded in 
a bit may be very high, and may not be the optimal selection from a viewpoint of 
coding. 

[0025] In order to deal with this problem, the newest encoder which Telenor has 
described chooses the optimal motion vector using the rate distortion (RD) criterion 
of a "distortion +Lx bit" form. A "distortion" value is MSE or MAD typically, "L" is a 
constant depending on compression level (namely, quantization step size), and a "bit" 
is the number of bits which coding of a motion vector takes. Generally, this type of 



every RD criterion can be used for this invention. However, in the case of this 
invention, the "bit" contains the bit which coding of a vector takes, and the bit which 
coding of the vector precision takes. In fact, since some candidates have some 
precision modes, it can have the "bit" value of the number of some. For example, the 
candidate in a location (1/2, -1/2) can also consider suddenly 1/2 or 1/6-pixel 
precision. 

[0026] (Fast search AMA method search strategy) As shown in drawing 5 and drawing 
6 , in the case of a fast search adaptation motion precision (AMA) method search 
strategy, an encoder investigates only a motion vector candidate's small set. At the 
1st step (108) of the fast search AMA, as for an encoder, one side checks eight 
motion vector candidates of the square (inside of the square range 1) of 1 with a core 
in the grid of 1 /2-pixel resolution to VI . V2 is set up in order to express the candidate 
(namely, eight last vectors and what is the optimal in VI) who has the minimum RD 
cost (110). Next, an encoder has a core in V2 on the grid of 1/6-pixel resolution, and 
one side checks the motion vector location of eight pieces in the square of 1 (112). 
When V2 has minimum RD cost (114). an encoder stops retrieval and chooses it as a 
motion vector of a block of V2. Otherwise. V3 is set up in order to express the optimal 
thing in eight last vectors (116). An encoder searchs the new motion vector candidate 
in the square range 1 which has a core in V3 in the grid of 1/6-pixel resolution next 
(118). Some candidates in this grid should be cautious of it being already check ending 
and being able to fly. At this last step, it chooses as a motion vector of a block of a 
candidate with the minimum RD cost (120). 

[0027] According to experimental data, since this easy fast search strategy checks 
RD cost of about 18 typical location in partial (there are more ten pieces than Telenor 
retrieval strategy) pixel space on an average therefore, the complexity of count as the 
whole stops at a moderate increment. 

[0028] It is shown that the experimental result explained below in relation to drawing 8 
thru/or drawing 18 actually has no loss of the compressive ability by using the fast 
search version of this AMA method. That reason is that this fast search AMA retrieval 
strategy is utilizing the degree of convex of a "distortion +Lx bit" curve (it is known 
that "distortion" is convexity) by founding the pass with which RD cost shifts to a low 
smartly from a high level. 

[0029] Another embodiment of this invention changes more than one or it of the step 
108-120. These operation gestalten are also efficient and the number of motion 
vector candidates checked in partial pixel velocity space is reduced further. 
[0030] In the example of drawing 7 . the candidate of 1 / 3 pel precision is checked. In 



this example, step 112 is replaced by one of the three possible scenarios. First, if 
there is optimal motion vector candidate from step 1 10 at the core of VI ("integer pel 
vector") (130), an encoder will check three candidates of 1 / 3 pel precision between 
a main vector, and 1 / 2 pel location which has the minimum RD cost next (132). next 

— if the optimal motion vector candidate from step 1 1 0 is the vector of a corner (1 34) 

— an encoder — ** — 4 vector candidates of 1 / 3 pel precision nearest to a corner 
vector [ like ] are checked (136). If the optimal motion vector candidate from step 110 
is [ 3rd ] between two corner vectors (138). an encoder will determine the one where 
RD cost is lower in two corner vectors, and will check the optimal motion vector 
candidate from step 110, and 4 vector candidates of 1 / 3 pel precision nearest to the 
straight line between corners [ like ] (140). In case there is no V2 in a core, it must be 
between two corners again when there is nothing also by the corner vector, and this 
process is performed, it should be cautious of step 138 becoming unnecessary. When 
setting an encoder as retrieval of the motion vector of 1/3-pixel precision, drawing 7 
can be changed so that it may become the termination instead of continuation to step 
114. 

[0031] (Reduction of computational complexity and memory) At step 108. in order to 
check only the motion vector candidate of 1/2-pixel precision, the computational 
complexity and memory which mounting of hardware and software takes are reduced 
sharply. Especially, in smart mounting of this fast search, a reference frame 
interpolates only 2x2 and obtains RD cost to 1 / 2 pel vector candidate, the fast (or 
cache) memory for hardware or software encoders is remarkable as compared with 
the approach of Telenor required in order that only 3x3 may interpolate a reference 
frame ~ amount reduction is carried out. As compared with the encoder of Telenor, 
cache memory becomes 9/4 of reduction, or 2.25 times as many reduction as this. 
Some additional interpolation can be performed per block later. 

[0032] Furthermore, since the interpolation in step 108 is used so that a search may 
be oriented in the direction which lowers RD cost function value, a composite filter is 
not needed for these interpolation. Therefore, computational complexity is reducible 
by using the easy congruence linearity filter for steps 108. 

[0033] Moreover, since such decision does not have the significant profits by using 
high degree of accuracy, a decision about other main coding like selection of a macro 
block mode (for example, 16x16, four 8x8 grades) can be made using 1 / 2 pel vector. 
Next, since an encoder interpolates a required partial pixel value to some additional 
vector candidate who checks in the remaining step, it can use a complicated cube (3rd 
order) filter. Since a macro block mode is already selection ending, it is necessary to 



carry out interpolation of these last to selection mode. 

[0034] By using two or more filters, it always compares with the approach of Telenor 
using cube interpolation, and is Sparc. Ultra Reduction of the computational 
complexity which exceeds 20% by the operation time in 10 mold workstation was 
obtained. Furthermore, the need for fast memory decreased in abbreviation one half. 
Moreover, about compressive ability, **** was almost without a loss. In a comparison 
with 1 operation gestalt of this fast search, it is only that this invention requires about 
seven interpolation per pixel to Benzler technique needing interpolation of about 70 
per pixel in a Telenor encoder. 

[0035] (Bit coding of a motion vector and precision) If the optimal motion vector and 
precision are determined, an encoder will encode both a motion vector and a precision 
value in a bit. One approach adds some extra bits, in order to encode a motion vector 
in a given precision (for example, half-pixel precision), next to detail the vector for a 
higher motion precision. Although this is the strategy which B.Girod proposed, it is 
suboptimization in respect of rate distortion. 

[0036] With one desirable operation gestalt of this invention, it encodes first using an 
easy sign as showed the precision of the motion vector to one macro block in Table 1. 
It is also possible to use other tables of code length s {1. 2. 2}'s arbitration. A bit rate 
can be further decreased using the typical DPCM method. 
[0037] 
[Table 1] 



[0038] Next, the vector value in each precision space is encoded. These bits can be 
obtained from the item of a single VLC table which is used in an H26L codec. An 
important idea is these bits' changing with motion precision of a macro block, and 
interpreted. For example, when motion precision is 1/3 and the sign bit to X 
component of a different motion vector is 000011. X components of a vector are 
Vx=2/3. If precision is 1/2. X component of a vector is equivalent to Vx=1. 
[0039] The approach of this invention is used for encoding the vector of the motion 
precision of arbitration as compared with the approach of Benzler which encodes a 
motion vector on the variable-length sign (VLC) table which can be used for coding of 



1/2 and a 1/4-pixel precision vector, and in each frame and a macro block, tables 
differ and can be interpreted. Furthermore, it is not necessary to be a multiple 
mutually or. and the overall approach of this invention can be applied to the motion 
precision of arbitration, and does not need to be a 1/n (n is integer) mold. Counting of 
the number of increments in given partial pixel space is carried out simply, and it uses 
the bit in the related item of a table as a sign. From a viewpoint of a decoder, if motion 
precision is decoded, a motion vector can also be decoded easily. Then, the related 
block in a front frame is reconstructed using the cube interpolator of four typical taps. 
A different 4 tap filter for every motion precision exists. Since the number of 
actuation which reconstruction of a prediction block takes an AMA method is the 
same and it is unrelated to motion precision, complexity of decode is not increased. 
[0040] (Experimental result) Drawing 8 thru/or drawing 18 show the result of having 
examined the coding codec of Telenor, without not using AMA or using it by various 
animation sequences, resolution, and frame rates of a publication in Table 2. These 
drawings plot rate distortion ("RD") for every case. The "Anchor" curve shows RD 
point from optimization H.263+ (only drawing 8 and drawing 9 ). Telenor 1 /"2+b" curve 
shows Telenor (classic example) by 1 / 2 pel vector, and congruence linear 
interpolation. The "Telenor 1/3" curve shows the proposal ("Telenor encoder") of the 
present Telenor. The "Telenor+AMA+c" curve shows the Telenor encoder by the full 
search strategy of this invention. The "Telenor+FSAMA+c" curve shown in drawing 
15 thru/or drawing 1 7 shows the Telenor encoder by the fast search strategy of this 
invention. (As long as there was no other convention, the full search version of AMA 
was the strategy of the encoder used for the experiment.) All the test results 
performed the crosscheck in the encoder and the decoder. When these results carry 
out AMA, the gain in the peak ("PSNR") of a signal-to-noise (noise) ratio exceeds 
H26L, is IdB in height, and is still higher than a classic example. 
[0041] 
[Table 2] 



[0042] The animation sequence is used in common by the animation coding 
community except for "shaking Paris." The latter is the synthetic animation obtained 
when only the motion vector to which X and Y component take any value in the range 
of [-1 and 1] moved the sequence "Paris" known well. This synthetic animation 
simulates the small motion produced with the stock camera in the typical animation 
telephone scene. 

[0043] (Comparison of the full search of a motion precision adaptation (AMA) method, 
and a fast search) the experimental result shown in drawing 16 and 17 shows that the 
usability ability of the encoder of a fast search strategy ("Telenor FSAMA+c") and a 
full search strategy (Telenor AMA+c — ") method about AMA is the same in fact. 
Since a fast search strategy uses the convexity of RD cost curve in partial pixel 
velocity space, this is infallible. Since in other words the configuration of RD cost 
curve follows a gently-sloping convex curve, the minimum value can be easily 
discovered using some smart fast search schemes which drop a curve. 
[0044] (Association of AMA and a plurality reference frame) In the graph shown in 
drawing 1 8 , the curve with the indicator of "1 r" used only one reference frame for the 
motion compensation. Therefore, these curves are the same as the curve shown in 
drawing 10 . The curve with the indicator of "5r" used five reference frames. The 
experiment shows that the gain by AMA is added to the curve obtained using two or 
more reference frames. The gain by AMA in the case of one reference frame can be 
measured by comparing the curve (notes: introductory notes x and **) of green and 
pink, and the gain in the case of five reference frames can be measured between the 
curves (notes: introductory notes O and -) of blue and red. 

[0045] Although a frame which especially the thing which should be careful of can 
carry out this invention with a frame level, and is different can use a different motion 
precision, it is using the same precision for all the motion vectors in one frame. As for 
motion vector precision, in this operation gestalt, being only once signal-ized in a 
frame layer is desirable. It is shown that compression gain which was shown per case 
of macro block adaptation also by using the optimal fixed motion precision to all 
frames produces an experiment. 

[0046] In the operation gestalt of other frame bases, an encoder can perform a motion 
compensation on the whole frame in a different vector precision, and can choose the 
optimal precision in accordance with RD criterion after that. Although this approach is 
not suitable for the one pass encoder of a pipelined architecture, it is applicable to the 
encoder of the software base, or more complicated coding. In the operation gestalt of 



other frame bases, in order that an encoder may predict the optimal precision over a 
given frame, a former statistic and/or a former formula can be used (for example, the 
formula described by the paper of Ribas or its version can be used). Although this 
approach depends for engine-performance gain on the precision of the formula used 
for prediction, it is suitable for the one pass encoder. 

[0047] what the vocabulary and expression which were used for the above-mentioned 
specification are a thing for explanation, and is restricted — it is not — ** — in using 
the vocabulary and an expression, there is no intention which eliminates the 
vocabulary and an expression with equivalent description of a display and description 
or its part, and it checks that the range of this invention is specified and limited by 
only the claim of a claim. [ like ] 
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[Brief Description of the Drawings] 

[Drawing 1] It is drawing showing the example of the location of the perfect pel in 
velocity space, and 1/3 pel. 

[Drawing 2] It is a flow chart Fig. for explaining the approach by the conventional 
technique of presuming the optimal motion vector. 

[Drawing 3] It is drawing showing the example of the motion vector candidate location 
at the time of the full search in partial pixel velocity space. 

[Drawing 4] It is a flow chart Fig. for explaining the desirable embodiment of the 
optimal motion vector presumption approach of the full search method by this 



invention. 

[Drawing 5] It is drawing showing the example of the motion vector candidate location 
at the time of the fast search in partial pixel velocity space. 

[Drawing 6] It is a flow chart Fig. for explaining the desirable operation gestalt of the 
optimal motion vector presumption approach of the fast search method by this 
invention. 

[Drawing 7] It is a flow chart Fig. for explaining another desirable operation gestalt of 
step 1 1 4 of drawing 6 . 

[Drawing 8] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"a container" and using or using adaptation motion precision (AMA) in the frame rate 
of per second ten frames, and resolution QCIF. 

[Drawing 9] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"news" and using or using adaptation motion precision (AMA) in the frame rate of per 
second ten frames, and resolution QCIF. 

[Drawing 10] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"an automobile" and using or using adaptation motion precision (AMA) in the frame 
rate of per second ten frames, and resolution QCIF. 

[Drawing 11] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation "a garden" 
and using or using adaptation motion precision (AMA) in the frame rate of per second 
15 frames, and resolution SIF. 

[Drawing 12] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"a garden" and using or using adaptation motion precision (AMA) in the frame rate of 
per second 15 frames, and resolution QCIF. 

[Drawing 13] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"Tempete" and using or using adaptation motion precision (AMA) in the frame rate of 
per second 1 5 frames, and resolution SIF. 

[Drawing 14] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"Tempete" and using or using adaptation motion precision (AMA) in the frame rate of 
per second 15 frames, and resolution QCIF. 



[Drawing 15] It is the graphical representation showing the result of having run the 
performance test of a Telenor encoder without having set to the animation sequence 
"shaking Paris" and using or using adaptation motion precision (AMA) in the frame 
rate of per second 15 frames, and resolution QCIF. 

[Drawing 16] In an animation sequence "an automobile", it is the graphical 
representation showing the result of the performance test of a fast search strategy 
("Telenor FSAMA+c") and a full search strategy ("Telenor AMA+c") in the frame rate 
of per second ten frames, and resolution QCIF. 

[Drawing 1 7] It is the graphical representation showing the result of the performance 
test of a fast search strategy ("Telenor FSAMA+c") and a full search strategy 
("Telenor AMA+c") in the frame rate of per second ten frames, and resolution QCIF in 
an animation sequence "a container." 

[Drawing 18] It is the graphical representation showing the performance test result 
which compares the case where it examines using two or more frames the case where 
it examines by resolution QCIF and the frame rate of per second ten frames in an 
animation sequence "an automobile" only using one reference frame for a motion 
compensation, and for a motion compensation. 
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n 7n -y ^7 ^1(Dtf gi^ijMti. t □ 7n -y ^' O 2 i^jc 

[0 0 0 3] {5iJ^a\ Jiv^D7P-y^t«t,-gfr§ 
#M V ^ n 7p >y ^ *'l^-f4H t ^ 5 i>±«SiB 

(0, 0) -e$.§o «fe-a-r§#Mv^'P 

7a -y MT-i^ a 7a -y i'W^ffillA^ f>;&}c 2 H^S 
(2, 3) -C?fe?>o fi^Xo^Jim^^'>h)Ui. 7j<¥^^ 40 

mm "mm^/v ) <Dmm^^ 

■rSiiEdo Elt*3V^T. ^^h^l'V,^ (1, 1) 

[0 0 0 4] i)ia>'->'ttJ<7)i^l(l*7'>*x^ hit. L*^ 

7b-A;!)^P,7U-A'\SIS(iiil|tM5i-#fuT- 

^^m-r^ufuy^ii, Hij7b-A«rNXN<0H? 50 



If fjg 2001-189934 
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(7r^^) T-MWU i^tcMraL/c7Wi.rtco«fe 

-at 5 V ^ p 7p 'y ^' ^-ft-^^r^ C (c i; D IgMT- 

/Nmm<Dm^imm?> c ii^^r-t. i /mm (xa 

[0 0 0 5] IT U«MWI¥fbg|5PlO "H. 2 6 L 
^M{C^^-r?iii^t¥t>'*Ht'\CDlS^ (Response to Call 

for Proposals for H.26L) " (Q. 1 5/S G 1 6, 
^»Q1 5-F-l 1, y7;K 1 9 9 8^^1 \m R 

If I T um-mmmitm^io "h. 2 6 im^Km 

5 T e 1 e n o r <D^M<D^it (Enhancement of theTe 

lenor Proposal for H.26L) " (Q. 1 5/S G 1 6, 
3i:»Q 1 5-G-2 5. hb-, 1 9 9 9^2^) 
{C:J3V^T> Gisle Bjoitegaardti, il 

mn^itm^H. 26L ("Teleno r ^mt 

1 e n o vn^itmit. irnmrnny ^ )\^^^m^^x 

#M7U-i.%3 X 3ffi[a-ri)/0^Xti "7'y7''-9-y7'U 
-rSo CiT^M/^-v^ayti. #JS7U-Aa)9 
fg(0;^^y^*SSi;-r5. HH©^^e.nfc:v^7P7a 

'y^ttfc'l^T. Teleno r ?vF^{b^««5aitlt-^^ 

h;i/^2-3(DXT-y7T-ifi/g-r5a ip-^. '^mtmts 
mmrji\/3mmmm.<D-%'^V)\/Vm ^umt^. m 

1 ©Mfii. 3 X 3ffiK#Ig7U-Al*IcD (16X16 

;I/V./3 = (VX, VY) = (1 + 1/3, 1) {CMS 

-r ^7p -y ^•e35i,gt,-gfr?.7P 7 ^^M-otj-tti 
to CCDTeleno v'^^Wt. '^-Of^nfm^M 
LTl/^So ^fe-f\ l/3'-5;;l'«acDE!]§'^^' h^b^SI+g 
•r5/£J6tg|5^)-«)i^77X h-9--^!iliSi:ffi*t^3i:^ 
f*7-i';l^^^ (^imtcioV^T) f$fflt5ct-e^§o 

{C, CCDT e 1 e n o r??^jl:t5«, l/3iii*}C@^ 

V\ <1©T e 1 e n o r ^^fktl©nr^S?5^ 

("VLC") T-fMt^ \ / smmim-^Ltcms. 

[0 0 0 6] «fea<»enTi^?.i]Hiaii;^riT-«. 
1 / 2ii^afi-eKt-<^ ^;^^^ssLT^f^^b■r§ 

SjStt^ w 1 5 mmm'mmm%mz ^ § /-f 



7 

[0 0 0 7] A^ia^lSmiWif^cD 1 Oi:LT, Ber 
n d G i r o d (DWsIC (J-X^G i r o d(DmiCtVBm 

fs) "'7fm-<)immic^mmmm\ (Motion-con 

pensating Prediction with Fractional-Pel Accurac 
y) " (IEEE mm^m, ^41#4^f, 604- 
6 1 2M, 1 9 9 3^4^) ij^mf^rt^o HCDG i r 

h^o G i r oda. g|5^ia^^P^tc4olt5«)iiJt-^ 

Mzmm-^mn^'?^ ( "mad" ) nmrn^m^^ 
ofccotcsiju^nso G i r o dtt. ^mmm 

[0 0 0 8] ftotM^Oif^i: LT, S m i t a G u 

p t aRVfk lien G e r s h oOffti (Kfi "G 

(On Fractional Pixel Motion Estimation) " (S P 
IE VC I P^ffi, 2 0 9 4#4 0 8-4 1 9M. 

yfV yiy, 1 9 9 3^11^) tmf^tl^. C<DG 
u p t a cO^jScfi. iiiBJ±IS<Dfci6icgP^^iB*«aTiJ 

l/^^o Gupta (Dti3tti, ¥±^SS^M ( "M S 

E " ) ^ Mmmmicm-^ < i>s*p^is l, ^ ©i^i^^ 

n/co onii, mm-^tiTc mmm^i) -n^mmx^ 

[0 0 0 9] "rtiy':7icm-:^<m^mmmmw^ 
imcDm-^o^mmmt^i^h^vmrn (on the optmai m 

otion Vector Accuracy for Block-Based Motion-Compe 
nsated Video Coders) " (IS T/S F 1 E "rU 

^/mm(DEm^m ■ tji^v Xht&m, 302-3 

1 4H. ^y-Ja-il, 1 9 9 6^2^) (Wk. R i b 

a s(DmiCtmit^') Ic^l'X. J o r d i Rib 
a s — C o r b c r a&O'D a V i d L. Neuh 
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*^<7):^S^SI^Lfco t:<7)R i b a s(DmiCit. ffiStffl 

[0 0 1 0] lii "^<0^^'\^y hU~hX(Dmmit 
10 tClB-rSiTLl/^nTll^OSg (Proposal for a new c 
ore experiment on Prediction enhancement at higher 

bitrates) " (ISO/IEC JTC1/SC29 
/W G 1 1 WlMRXS^Pi^^mt. MPEG97/1 8 
2 7, -be 'J 7. 1 9 9 7¥2J^) fit/ "1/4'^;HI) 

(Performance Evaluatior of a Reduced Complexity I 
mplemetatior for Quarter Pel Motion Compensatio 
r) " (ISO/IEC J TC 1/S C 2 9/WG 1 
1 mmRlfB!^(D^^-it, MPEG97/3 1 46, -9" 
20 yi^a-t, 1 9 9 8¥lfi) Kfel/^T. U r i c h B 
e n z 1 6 r fi. i/Piv'-'yyXOfcfefc \ / 
mU(DW]^^'7V)\'(D'^mt. M P E G 4 ijB?^^{bS 

L^^if^, Benzlerti, \ / A^)]/(DW]^^^ V 
)\'<D%mcQ, i r o d<D7 7Xh-9--^i$ffi?:fflt/^fco 

B e n z 1 e rli. S^Sffir^7^';l'^^#liL^A^o 

-F (M;t{f> 16X16, 4-8X8, 16-4X4 

m) ic-D^ I / 4-^jvmmxsmbxmm(D^-h'^^ 

^t^tciblC. tmm.tf'^r^Xh^o B e n z 1 e r{i 
at^tc. Benzler^i. 1 / 2 S.t>' 1 / 4 iS^Wg 

40 ( "V L c" ) y—y)\^^m\'^xm^^'^ 

[0 0 1 1 ] 

Yj\^mm:^mtt^MWsm.^^m\>>x\^ 

[0 0 12] 

50 [^;s*»p^-r5fcJ6«D^©] *56Wioio<D^igsmi 
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[0 0 13] *%B^©!iiiS^fflv>5i;. iftiiit^^Wi 

trntx 3 0 %{cmr^\£y h u-hcomM) ^mmio 
fSB/§timsj#«a ( "AM A" ) yj^tLximr^t 

-So 

[0 0 14] ^^mmmLi^mmnmcDi'Dii^ 

mm (+J-7't!^-t7W sp^^cfc^j-§7 7X^+^-^«l& 

oy>; <y Fi^T\ ?i?2-tr-y h(Dmt^'> h;imm^um 

[0 0 15] ^%mmimiiL\,mmmmjcis\,'r. 

[0 0 16] ^mM(Dm(Dit}tL^^mmBmii. u-h 
(RD) \zmhxwm(D<'!7V}imm^mm 

[0 0 17] *fg0H©$?>fCgiJCD»^U!.^IISSms[±. 

m)]m^mmt (vlo rstcj;!?. ftt-^^h;u 
[0 0 18] ^mm±}&Rxiimm. ^WLRm^iM. 

[0 0 19] 

^ic:isi^xmmm^mLxmmt^t)\ mm^^ty 

-'^r L Tlil£X« "7 b- AfttCgM-r Stf^tc 

Ltc^^iz. T e 1 e n o r mmmim (Rm^ 

iatTc I enor(Dn^itm) ^mttit^^<DtLX 
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IB^-rSc Telenor ©S)iiif9^{b^©ffl|g%Ml/^ 

[0020] mm^^im<D^<it. mm (xa " i 

§0 Telenor ©fff-^fbts©^ 1 1 

T e 1 e n o r O|vF^{t:f§c0«ff/^-v^~3 
?P):&Sffiffi^iJI#^?f?.;'ci6tci/3^;b-^^ 

10 y^WB^mm^^iHrntxi^'^o mc. i-^cD#^e.n 

f=V^ayD>y^'^C^5^^T, T e 1 e n o r (D^mm 

fi. El2(c^-r 2ocDXr'y7°-e«ji;5:t(it^^ 

«^^5o ^■r. ii^T e 1 e n o T^mtmii. mm 

*SiS^;l/<^F;l^V, (01) ^j^?gf5 (Xr^y^ 

100)0 i^t. lICDT e 1 e n o r ^?^^kg5^i^ Vi 
ffffi®*!^ 1 / 3^;l/)BS^^ h;l/V,/:i (01)^ 
•9--f-r§ Ur-yT'l 0 2) o cco^ZXT-yfim 

1 (D^^77t:^-rcj: 9 3 X 3 rt^ mm) pmy u 

-ArtO 16X1 60l«?iJ%Wr?>) :i:&(2-e8 

20 miDyo^y^^m^xm^-mt^yu-y^^^-Difta 

o^"U-y Kft'{C8fiO||^ST*7SbT$.?.c Ell (cfcV^ 

T. i^t-a-r?.®a. m^^'^ hJi'Wu, = (v«, v 
0 = (1 + 1/3, 1) icmmt^7tiy^x$>^o 

[0 0 2 1] *^W(DS*l5JcJ:n{f. ?SF^{kg§{i. yjl 

fisoB $^a©ffi (j?ij ^ ii\ 1 / 2 , 1 / 3 so? 1 

[0 0 2 2] (7;l/-9--f- A M A 7?S<D-9--f-«cB§) 0 

30 3RXimnc^stJ;^dlc. C(Dy)l^-^'MlZMtW& 

(AM A) 731*^-9— ^isKScOii^. Tel enor^n 

i:LTStt^n§) "jETiB^m" ti/emmmmA 

mH^TT^t^oic, 7;l/-9--f-AMA05g 1 Xf--y7° 
(10 4) T-ii:a}i%Sa^;l/'-N^ 1-71/Vi (01) ^ 
■^-=f-t^o 7;l/-it- f'AMA<^|g2Xr-y7' (1 0 
6) {CfcV^T. t?^{bt§WV,#jS<D«3g^l/6Bi^ 

40 ftfi--^^ F;bv,/6 (03) ^-y— f-r?)o i^wfn 

if. C07;l/-y— f-AMAt±. Telenor 7°n-fe7. 
2 XX -y T'^^H LT^^ftglA^JIS^^I^Offl© 

i.ol^Lrc^(DXh^o ^<DBmi. ^'"U -y Frt^DRii 

gp-&. iiv^pyu'y^icet-iscrs 
:/D>y^ (ffira#M7U'-Art) ^fg^-rs-^^' F;i/^ 

f§m-r§c tT-$,§o i:co7;i/+f-f-lf5S«. i20fa 

50 [0 0 2 3] mt^^ F^lz-y— f-(C*5ttSfflS^SI5M 
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(oyimi. ^i^^mmm cms e) xa¥i^*g?^M5> 
(MAD) (Di^m^aom^sm^m^ri^^^o zocoy 

^m(OMADmit. a^B+»c»f^t)D{i:iii*iill(D*6 

cDiiifi7n'y^A^SVHc|imT*n(i\ MS ERlfMA 

^^itmii. «/JncOMS EXfia/hOMAD(Df5in:6^% 

M S E XfiM A D{C*3l^Tf/f4iDV^' n7P >y 
[0 0 2 4] ^i.'t^-^^e. MS ES:t>*MAD©^*a!ia 

[0 0 2 5] ncorsm^mmt^rMc. T e l e n o 

x(d--yh" fl^oL— hfi^;^ (RD) |iJ5ga?P^ffifflLT 
(CtiMS EX«MADT-^i3. "L" imm\y^fl (IP 

tflfflT-t^o L*^L75:*'iP.. *fgH^£D«^. "tr>y 
h" ti. '-^^ h;l/£D??F^fl:^cg1-§t:'-y h 
;U«JgcotfF^f|;{cS-r-Sii'-y ^*#^•^■l.^§o «l 

i!(tD "id--y y m^i^m^o mm. m (1/2, 

-1/2) {Cft§<l^ffi(i, l/2X(±l/6iii*|ia 
[0 0 2 6] (7 77.h-9— f-AMA7?S-9— ^KSS) 

s (A MA) ij^v--9mio^'^. ^mtmm^^ 

^AMACD^I Xx^yy (10 8) Tii. |vf^fkt§ti. 

1 / 2 iii^lllfi® o y u -y F *3 1/ ^ T V . fp;L,^;£ 
-mt^ I ©lEM^ (lE^^ff^lSH 1 P^) ©8f@©l!)t^^ 

h ;bfgffi>&^ X -y ^ t" ^ o «/J^^D R D n X h ^ Wr 5 
ffi (EP-^^ SficDm^^' Vi Ocf'T-aii'Dt®) 
^SS^-T'l./caitcv.^.^jg-r^ (1 1 0) o :-Xlc. ^ 
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^x-y^'f^ (1 1 2) o VzA^ft/hRDnXhJ&^L 

TV^^^^ (114). l?F^ft:#§a«^;&cl3±L, V. 

i6icv3^mmt^ (1 1 6) „ ^mtmi. Max 

J^^H liftoff Li^Kt-^^ h;l/Mi<i%-<t-^t§ (1 
1 8) 0 CCD^^^U-y FrtCDilO*^<r)^MaffifC^x>y^7 

^^tiy^<Dm^<^V)itLxmmr^ d 20) o 

[0 0 2 7] ^^•r-^?fCj;na\ CC0®*:^77XF 
•y— ^IMti. ¥±^1? (T e 1 e n o r S^lBgi; 1 
Offl^v^) g|i^^iii^SP4ti:t5tj-Sf-'cSW*l>) 1 8©fuM 
«RDaXF^f-x'y^-r?iCDT\ fifoT. ^{*i:LT 

[0 0 2 8] g|8>n,Sill StMilbTUTtClfiB^f 5 
*i^^*(±. ii©AMA;^^<D7rXh-9--f-/<?-v'a 

A M A W.W^^f)\ RTiU7.VtS^m^^)\'t-^^^V^)\' 
tc X V- F ^ff -r § ^ ^ X ^lO^f 5 C t J: o T 
^i+Lxe-y F" mi(Dih]& ( "itj^i" (iiattT-^Se 

[0 0 2 9] *%^(DglJ(D|IS6^m^ Xr-y/l 0 8 

- 1 2 ocD^-^cD 1 -:)Xfi^ni:^±^SM-r§o ens 

-So 

[0 0 3 0] 07 m\X^i. 1 / 3 -^^HiS^fSM^f^ 
x-y^'f^o CCDlliSifi^iJiCfcV^T. XT-^yT"! 1 2tt3 

f--y7°l 1 0*>6tD«3i:&|flt'^^' FMl^liAW 

. ( "fiigf^;l/^^ F;i/" ) coff'LNtc^nif (i 3 

ffi^9=-x-y^-r§ (1 3 2) „ '-klC^ Xf-'y7°l 1 0;^)^ 
50«ja>5:ij^'^^ F;H8ffi*"!n— t-cD^^ F;l/-r-fe 
ntf (1 3 4) . ^^fkf|{i*^<};d*3-:^-^^F;I/ 
icfttjfil/^ 1 /3^)imA(r)4^^ F;W8ffi^f-x'y^ 
■rs (13 6). mSliC. X-r-yT"! 1 0*>e.©«jii5: 

if (1 3 8) . mmmii2-ocDa~i—^'> h)l(D<^ 
XRD^iXhm&l^y^^^'jtb. XT-y7°l 1 OA^SO 
Sji^Sit-^^ F;MSIii;A^<t-5S3— t-iicDf^coil 

^t^ (1 4 0) o V2t'^^'bl,cm<X:^-i--^'y h 

;i/-efe»t/^ti^i±2-3©n— ;^-f^l^c^.§ll■^-$,t)^ c 
^yn-tX^SIff-r^l^tcxr-y:/! 3 Sli^iBmcfj: 
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[003 1] mmmt^^v(Dmm xi^y/i o s 

Xh^-^(DX-r-hrj:mmc^^''r. mM7\y-L.I,i 

2X2 tt«F^ L r 1 / 2 -<.)i-< ^ h Mmmc Mt 5 R 
mm77Xh mi^^y-y:x) ^^va^ ^m7u 

-L.^3x3mmmt^rcibl>cmt^T e l e n o r 
miiti:mLXf:'rj:>:)(Dmm'mn^. le l eno 
r (D^^itm timt. + + <y i U « 9 / 4 ©iJ 

[0 0 3 2] ^Sfc, XT^-yn 0 8(c*5it§MKlJ:R 

D F mmm^T^f^yimc^-f-^yj\ti-:sif:^^o 

[0 0 3 3] X. v^n7P-y^^-F (^Jxtf. 16 
X 1 6 , 4 «<D 8 X 8 (?)ji^«<j; d ^ffi£D±S;5:?fF 
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[0 0 3 4] w^7^)v^-^mt^it\z^K). nn 
irn^mm^my^^^T e i e n o T<oi5mt]mL. s 

pare Ultra 1 OSy^-^Xx-i/H ^T-cT) 

X F-9— f-cD 1 SSSm^TcDi:fce?t*5V>T. B e n z I 
10 e rSati. T e 1 e n o r ^^^McfeV^TB^afc 
0^7 0(7)MS?r*;>Si:-rS(^JcWU *^B^{iiii*^ 

[0 0 3 5] m^^^ V)itm&(D^y v^^t) m 

Ant^o iinii. B. Gi r 0 d;0"i?S^L/clli)ST-SS 

20 Ubs u-hm^(OMxii^'?fmmitx^^. 
[0036] :^mM<D 1 -D(Diff^ n^^nmnmxii. i 

(1, 2, 2} cDffiMWfficDT— 
i:feRlfigT'fe5„ lf«yFU-F«, f^^W^&DPCMa 

[0 0 3 7] 
[«1] 



0 1 



1 1 



1/2-:/!/ 



*1. Jr*<OVi^n:/oy^rKSti-a|»ar^-KS.^^lij|gj| (VLC) 



[0 0 3 8] i^t. ^f^nm&'^fmr.iin^^^ VMU 
t^J^,^Ti^ffl^n§ J; 9 V L c f—zr^KDmsi. 

h « D 7d ^y ^ (J^Kl tafitc J; o TS^o T»fj^^ 40 

n^^txh^o »)^«a;()U/3-egD, s 

F^l/«OXfi!ci)-{c5t^-r:5^^tr>y Vt!^0 0 0 

0 1 IT-feSif^. F;L'©X)jic5jHiVx = 2/3T 

= 1 {cfflsrso 

[0 0 3 9] l/2S:U"l/4ia^ffia-^^F;l/CDt?-^ 
fttfffflT-t^nr^ftffF^ (VLC) T-7MZ^y)m 
^^^Y)\y^^mtt^^ enz 1 e v (Dy^xktimL 



ST-feo/cD 1/n (niilfO iy-efe§£^s^i%^^o 
~y)i'(DWMm.E^(^^yv^'^^tLx^m-t^. m 

A<43©Ma7'P 'y 4 ^? -y ^(DiLyjmmm 

^m^^'xnmmt^o »mmmmicm^4^'y7°7 

[0 0 4 0] (»S«) m 8 7^)50 1 8 g 2 {C^2 
tDftHi^-'^-yX. ^^astf 7 I^-A U~ F 
T^AMA^r^^fflXfi^^MJ'^-ftT e 1 e n o r(D??^-{l: 
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mi. ^■y-xmicu-hm^ ( "rd" ) ^imirc 

^(D-Qh^o "Anchor" fflUfi, ftJiftH. 2 6 3 + 
(08 tBigcOi'*) *^P,^DRD;^^^LTV^5o "Te 
lenor 1 / 2 + b " ftlgti 1 / 2 h;l/i: 
WmMmiC^:^! e. lenor (SftW^ftW 
TV^-So "Te I e n o r 1/3" fti^tt, STe 1 e 
n o v(D^m ( "T e 1 e n o r^mm" ) ^fnLX 
l/^^o "Te 1 e no r+AMA+c" MBt. *l80i§ 
(Dy)V^-=^-'mmcX^'Y e 1 e n o v'^mtm^^^L 
Tl^^c m \ 57^S01 yfC^^-r "T e 1 e n o r + F * 
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* SAMA + c" *%B^(7)7rXh-9--^islll§t 

i;5T e 1 e n o r tvF^fbf§^/T^ UTV^^o (ffifCJI^ 
/^(^'^l/^PSD. AMAC07;l'-y--f-/^->>'3 ytiHi^t^ 

AMA^iisa-r§«^. fs^^^st (/ 

YX) ( "PSNR" ) (Cfctj-?.?lJ|#(4, H 

2 6 £;g:ffl^ 1 d B (D^tVh *) . S*W=S:0iJ cfc D 5 

10 [0 0 4 1] 
[^2] 













H 8 


QCI F 


1 0 


=3.— ;^ 


B 9 


QC I F 


1 0 




Eai 0 


QC I F 


1 0 




mi 1 


S I F 


1 5 


«■ 


i Sfl 2 


QC I F 


1 5 


Temp e t e 


Bl 3 


S I F 


1 5 




Bl 4 


QC I F 


1 5 




HI 5 


QC I F 


1 0 



«2. mm.<f>mm 



Co 0 4 2] illifiiv-'^ryxJi, "t0tl-&/^';" 

^'Tiff- [- 1 , 1 ] tDlSH7-ftM(7)fiI;§i:?,i]t-^i' h 

CO 0 4 3] (i&tHjaiilS (AMA) 73ie07;U-tl-- 30 
^Ry'7 7Xh-9--^Oltfii) 01 6Stfl 7{c^L/-c 
^i^*SIRa, AMAfc|g-r§7rxh-9--^|jeSS ( "t 
elenor F SAMA+c"),!; 7;l/-9--5'KBS 

(Telenor A M A + c ") :77^cOt?^ftf§«f^ 

ffltti6*^lll^{c«iii;T'fe5i:i!*^LTv>-5o 7rx 

T\ ^«g/jNfil(i. !a,fS^m^•y:■S.I|•^*^£DXv-^ 
^7rx h■9--^X4^-A;g:ffll/^T§B^cfgM■e#?.„ 40 
CO 0 4 4] (AMASy||^j(#Bg7U-AOlSa-) 0 

1 8^c/^■r^"^7^c^3l^T. "i r" o^i^^&feoflis 

il^ffi{ficD/ci6(i: 1 ^^0#H§7lx-A0*i%ffiffl t 

fc. 'i^t-DX. iin^^ffilSfiai otC/TLfcfflHiiBli; 
T-3o5o " 5 r " 5 «J©#B87 U- 

i.^^ffl L/co A M A t j; 5fiJf#[i:?Il^tD#{'§7 b-A 

#M7l^-A lft(D^^<r)AMA(cj;§fiJf3(i, 
illfy^cDiaiS (a : ilMT'X tA) ^tmt^ctic 



Co 0 4 5] mcjiimt^^ct^i.. *fgS^(i71^-A 

iy-^)ixnmx^. ^rs^yu-A^m^ji^mmm^ 

f^i^^ctti^x^^t^ \yu-L.n<D^xcom^^'^ 
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1 . Title of Invention 

METHODS FOR MOTION ESTIMATION WITH 
AD APTTVE MOTION ACCURACY 

2 . Claims 



I . A fest-scaich adaptive motion accuracy search method for estimating 
motion vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching a first set of motion vector candidates in a grid of sub-pixel 
resolution of a predetermined square radius centered on V, to find a 
best motion vector 

(b) searching a second set of motion vector candidates in a grid of sub- 
pixel resolution of a predetermined square xadius centered on to 
find a best motion vector V3; and 

(c) searching a third set of motion vector candidates in a grid of sub-pixel 
resolution of a prcdetennincd square radius centered on V3 to find 
said best motion vector of said macroblock. 

2. The method of claim I, said step of searching a first set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermmed square radius centered 
on V, to find a best motion vector V, further comprising the step of scaichmg a first set of 
eight motion vector candidates in a grid of 1/2-pixel resolution of square radius 1 centered 
on Vi to find a best motion vector Vj. 

3. The method of claim 1, said step of searching a second set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius centered 
on to find a best motion vector V3 further comprising the step of searching a second set 
of eight motion vector candidates m a grid of 1/6-pixd resolution of square radius 1 centered 
on V2 to find a best motion vector V^. 
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4. The method of claim 1 further comprising the steps of using V2 as the 
motion vector for the block if has the smallest rate-distortion cost and skipping step (c) 
of claim 1. 

5. The method of claim 1, said step of searching a third set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius centered 
on V^i to find said best motion vector of said macroblock further comprising the step of 
searching a third set of eight motion vector candidates in a grid of 1/6-pixel resolution of 
square radius 1 cientered on V3 to find said best motion vector of said macroblock. 

6. The method of claim 1 , said step of searching a third set of motion 
vector candidates in a grid of sub-pixel resolution of a predetermined square radius centered 
on V3 tQ find said best motion vector of said macroblock further comprising the step of 
slapping motion vector candidates of said third set of motion vector candidates that have 
already been tested. 

7. The method of claim 1 finther wherein said step of searching said first 
set of motion vector candidates further comprises the step of searching said first set of 
motion vector candidates using a first filter to do a first interpolation, said step of searching 
said second set of motion vector candidates fiirther comprises the step of searching said 
second set of motion vector candidates using a second filter to do a second interpolation, and 
said step of searching said third set of motion vector candidates farther comprise the step 
of searching said third set of motion vector candidates using a third filter to do a third 
interpolation. 

8. The method of claim 1, said step of searching a second set of motion 
vector candidates in a grid of sub-pbcel resolutioii of a predetermined square radius centered 
on to find a best motion vector V3 further comprising the steps of: 

(a) searching three candidates of l/3ipel accuracy Vj and a 1/2-pel 
location wifli the next lowest RD cost if is at the center; 

(b) searching four vector candidates of ITB-pel accuracy that are closest 
to V2 if Vj is a comer vector; and 
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(c) determining which of two comers has lower RD cost and aeaiching 
four vector candidates of 1/3-pel accuracy that arc closest to a line 
hetween said comer wifli lower RD cost, if is bctwecai two comers 
vectors. 

9, An ad^tive motion accuraiqr search method for estimating motion 
vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching a first set of motion vector candidates in a grid centered on 
V, to find a best motion vector Vj using a first filter to do a first 
interpolation; 

(b) searching a second set of motion vector candidates in a grid centered 
on Vj lo find a best motion vector Vj using a second filter to do a 
second interpolation; and 

(c) searching a third set of motion vector candidates in a grid centered on 
V3 to find said best motion vector of said macroblock using a third 
filter to do a third interpolation* 

10. The method of claim 9 wherein said step of searching using a first 
filter to do a first mteipolation fiirther comprises using a simple filter to do a coarse 
interpolation. 

11. The method of claim 9 wherein said step of searching using a first 
filter to do a first interpolation flirther comprises using a simple filter to do a coarse 
interpolation and said step of searching using a second filtesr to do a second interpolation 
further comprises using a complex filter to do a fine interpolation. 
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12. The mediod of claim 1 1 wherein said step of searching using a third 
filter to do a third interpolation further comprises using a complex filter to do a fine 
interpolation. 

13. The method of claim 9 wherein said step of searching using a first 
filter to do a first interpolation farther comprises using a bilinear filter to interpolate the 
reference Jfirame by 2x2. 

14. The method of claim 9 wherein said step of searching using a first 
filter to do a first' interpolation fiirthcr comprises using a bilinear filter to mtcipolatc the 
reference firame by 2x2 and said step of searching using a second filter to do a second 
interpolation fiirthcr comprises using a cubic filter to do a fine interpolation. 

15. The method of claim 14 wherein said step of seardhing using a third 
filter to do a third interpolation fiuther comprises using a cubic filter to do a fine 
interpolation. 

16. An adaptive motion accuracy search method for estimating motion 
vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching at a first motion accuracy for a first best motion vector of 
said macroblock; 

(b) encoding said first best motion vector and said first motion accuracy; 

(c) searching for at least one second best motion vector of said 
macroblock at an at least one second motion accuracy; 

(d) encoding said at least one second best motion vector and said at least 
one second motion accurac)^ and 

(e) selecting the best motion vector of said first and at least one best 
motion vectors using rate-distortion criteria. 
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17. The method of claim 1 6 whereiii said step of selecting the best motion 
vector using rate-distortion criteria fiirfher comprises the step of said rate-distortion criteria 
adapting according to the different motion accuracies to detennine both the best motion 
vectors and the best motion accuracies. 

18. The method of claim 1 6, said stqp of searching for at least one second 
best motion vector at an at least one second motion accuracy further comprising the step of 
searching for at least one second best motion vector of said macroblock at an at least one 
second motion accuracy that is finer than said first motion accuracy. 

19. The mefliod of claim 16 wherein said step of selecting the best motion 
vector using rate-distortion criteria further comprises the step of using rate-distortion criteria 
of the type "distortion H- L*Bits" to select the best motion vector. 

20. An adaptive motion accuracy search method for estimating motion 
vectors in motion-compensated video coding by finding a best motion vector for a 
macroblock, said method comprising the steps of: 

(a) searching at a motion accuracy for a best motion vector of said 
macroblock; 

(b) caicoding said motion accuracy using a code from a VLC table that is 
intcxprcted differently at different coding units according to the 
associated motion vector accuracy; and 

(c) encoding said best motion vector in the respective accuracy space. 

21 . A system &r estimating motion yectois in motion-compensated video 
coding by fiading a best motion vector for a macroblock, said system comprising: 
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(a) a first encoder for searching a first set of motion vector candidates in 
a grid of sub-pixel resolution of a predctcnnincd square radius 
centered on V, to find a best motion vector Vj; 

(b) a second encoder for searching a second set of motion vector 
candidates in a grid of sub-pixel resohition of a predetennined square 
radius centered on Vj to find a best motion vector V,; and 

(c) a third encoder for searching a third set of motion vector candidates 
in a grid of sub-pixel resolution of a predetermined square radius 
centered on to find said best motion vector of said macroblock. 

22, Thesystemofclaim21 wherein said first, second, and tturd encoders 
are a single encoder. 

. Detailed Description of Invention 

BACKGROUMD OF THE INVENTION 

The present invention relates generaUy to a method of compressing or 
coding digital video with bits and, specifically, to an effective method for estimating and 
encoding motion vectors in motion-compensated video coding. 

In classical motion estimation the current fi*ame to be encoded is 
decomposed into image blocks of flic same size, typically blocks of 16x16 pixels, called 
"macroblocks/* For each current macroblock, the encoder searches for the block in a 
previously encoded fimae (Ihe "reference fiame**) that best matches the current 
macroblock. The coordinate shift between a current macroblock and its best match in the 
reference fi^ic is represented by a two-dimensional vector (the "motion vector") of fee 
macrobloclc Each component of the motion vector is measured in pixel units. 

For example, if the best matdi for a current macroblock happens to be at 
the same location, as is the typical case in stationaiy background, the motion vector for 
the current macroblock is (0,0). . If the best match is found two pixels to the right and 
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three pixels up from the coordinates of the current macroblock, the motion vector is (2,3)* 
Such motion vectors are said to have integer pixel (or **integer-pel" or "full-per') 
accnracy^ since fheir horizontal X and vertical Y components are integer pixel values. In 
FIG. 1, the vector Vj represents the full-pel motion vector for a given current 
macroblock. 

Moving objects in a video scene do not move in integer pixel increments 
from frame to frame. True motion can take any real value along the X and Y directions. 
Consequently, a better match for a current macroblock can often be found by 
mterpolating the previous frame by a factor NxN and then sean±ing for the best match in 
the interpolated frame. The motion vectors can then take values in increments of 1/N 
pixel along X and Y and are said to have 1/N pixel (or " 1/N-pel") accuracy. 

In "Response to Call for Proposals for H,26L," ITU-Telecommunications 
Standardization Sector, Q.15/SG16, doc. Q15-F-1 1, Seoul, Nov. 98, and "Enhancement 
of the Telenor proposal for H,26L;* ITU-Telecommunications Standardization Sector, 
Q.15/SG16, doc. Q15-G-25, Monterey, Feb. 99, Gisle Bjontegaard proposed using 1/3- 
pel accurate motion vectors and cubic-like interpolation for the H26L video coding 
standard (the "Telenor encoder"). To do this, the Telenor encoder interpolates or **up- 
samples" the reference frame by 3x3 using a cubic-like interpolation filter. This 
interpolated version requires nine times more memory than the reference frame. At a 
given macroblock, the Telenor encoder estimates the best motion vector in two steps: the 
encoder first searches for the best integer-pel vector and then the Telenor encoder 
searches for the best 1/3-pixel accurate vector Vyj near V,: Using FIG. 1 as an example, 
a total of eight blocks (of 16x16 pixels) in the 3x3 interpolated reference frame are 
checked to find the best match which, as shown is the block associated to the motion 
vectorV,^ = (VX, VY) -(1+1/3,1). The Telenor encoder has several problems. First,it 
uses a sub-optimal fast-search strategy and a complex cubic filter (at all stages) to 
compute the 1/3-pel accurate motion vectors. As a result, the computed motion vectors 
are not optimal and the memory and computation requiremmts are very expensive. 
Furtiber, the Telenor encoder uses an accuracy of the effective rate-distortion criteria that 
is fixed at 1/3-pixel and, therefore, does not adapt to select better motion accuracies. 
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Similarly, the Telenor encoder variable-length code ("VLC") table has an accmacy fixed 
at 1/3-pixcl and, therefore, is not ad^ed and interpreted diBferently for different 
accuracies. 

Most known video con^ression methods estimate and encode motion 
vectors with 1/2-pixel accuracy, because early studies suggested that higher or adaptive 
motion accuracies would increase computational complexity without providing additional 
compression gains. These early studies, however, did not estimate the motion vectors 
using optimized rate-distortion criteria, did not exploit the convexity properties of such 
criteria to reduce computational complexity, and did not use effective strategies to encode 
the motion vectors and their accuracies. 

One such early study was Bemd Girod's "Motion-Compensating 
Prediction with Fractional-Pel Accuracy," IEEE Transactions on Communications. 
Vol. 41, No. 4. pp. 604-612. April 1993 (the "Gixod work"). Tlie Giiod work is the first 
fundamental analysis on the benefits of using sub-pixel motion accuracy for video 
coding. Girod used a simple, hierarchical strategy to search for the best motion vector in 
sub-pixel space. He also used simple mean absolute difference ("MAD") criteria to select 
the best motion vector for a given accuracy. The best accuracy was selected usmg a 
formula that is not useful in practice since it is based on idealized assumptions, is very 
complex, and restricts all motion vectors to have the same accuracy within a fiame. 
Fmally, Girod focused only on prediction enor energy and did not address how to use 
bits to encode the motioii vectors. 

Another early study was Smita Gupta's and Allen Gersho's "On Fractional 
Pixel Motion Estimation," Proc. SPIE VCIP, Vol. 2094, pp. 408-419, Cambridge, 
November 1993 (the "Gupta work"). The Gupta work presented a method for computmg. 
selecting, and encoding motion vectors with sub-pixel accuracy for video compression. 
The Gi^ta work disclosed a fijnnula based on mean squared error ("MSB") and bilinear 
interpolation, used this formula to find an ideal motion vector, and then quantized such 
vector to the desired motion accuracy. The best motion vector for a given accuracy was 
found using the sub-optimal MSB criteria and fbo best accuracy was selected usmg the 
largest decrease in difference energy p«a: distortion bit, which is a greedy (sub-optimal) 
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criteria. A given motion vector was coded by fiist encoding that vector with 1/2-pel 

accuracyandthenencodingthehigheraccuracywithrefinementbits. CouBe-to-fine 
coding tends to require significant bit overhead. 

In "On the Optimal Motion Vector Accuracy for Block-Based Motion- 
Compensated Video Coders," Proc. IST/SPIE Digital Video Compression: Algorithms 
and Technologies, pp. 302-314, San Jose. February 1996 (the "Ribaa work"). Joxdi RiW 
Corbera and David L. NeuhofF. modeled the effect of motion accuracy on bit rate and 
proposed several methods to estimate the optimal accuracies that minimize bit rate. The 
Ribas work set forth a fuU-search approach for computing motion vectors for agivcn 
accuracy and considered only bilinear inteipolation. The best motion vector was found 
by minimizing MSE and the best accuracy was selected using some formulas derived 
from a rate-distortion optimization. The motion vectors and accuiacies were encoded 
with frame-adaptive entropy coders, which are complex to implement in real-time 
applications. 

In "Proposal for a new core experiment on prediction enhancement at 
higher bitratcs." ISO/IEC JTC1/SC29/WG1 1 Coding of Moving Pictures and Audio. 
MPEG 97/1827. Sevilla, Feb. 1997 and "Performance Evaluation of a Reduced 
Complexity Implementation for Quarter Pel Motion Compensation," ISO/IEC 
JTC1/SC29/WG11 Coding of Moving Pictures and Audio, MPEG 97/3146, San Jose, 
Jan. 1 998, Ulrich Benzler proposed using 1/4-pel accurate motion vectors for the video 
sequence and more advanced interpolation filters for the MPEG4 video coding standard. 
Benzler, however, used the Guod's fast-search technique to find the 1/4-pel motion 
vectors. Benzler did consider different mterpolation filtere, but proposed a complex filter 
at the first stage and a simpler filter at the second stage and interpolated one macroblock 
at a time. This approach does not require much cache memory, but it is computationally 
expensive because of its complexity and because all motion vectors are computed with 
1/4-pel accuracy for aU the possible modes in a macroblock (e.g„ 1 6x16, four-SxS, 
sixteen-4x4, etc.) and then the best mode is determined. Benzler used the MAD criteria 
to find the best motion vector which was fixed to 1/4-pel accuracy for the whole 
sequence, and hence he did not address how to select the best motion accuracy. FinaUy, 
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Benzler encoded the motion vectors with a variable-length code ("VLC") table that could 
be used for encoding 1/2 and 1/4 pixel accurate vectors. 

The references discussed above do not estiDiate the motion vectors using 
optimized rate-distortion criteria and do not exploit the convexity properties of such 
criteria to reduce computational complexity. Further, these references do not use 
effective strategies to encode motion vectors and their accuracies. 

BRIEF SUMMARY OF THE INVENTION 

One preferred cmbodnncnt of the present invention addresses the 
problems of the prior art by computing motion vectors of high pixel accuracy (also 
denoted as "fractional" or "sub-pixel" accuracy) with a minor increase in computation. 

Experiments have demonstrated that, by using the search strategy of the 
present invention, a video encoder can achieve significant compression gains (e.g., up to 
thirty percent in bit rate savings over the classical choices of motion accm-acy) using 
similar levels of computation. Since the motion accuracies are adaptively computed and 
selected, the present invention may be described as adaptive motion accm-acy ("AMA"). 

One preferred embodiment of the present invention uses fast-search 
strategies in sub-pbcel space that smartly searches for the best motion vectors. This 
technique estimates motion vectors in motion-compensated video coding by finding a 
best motion vector for a macroblock. The first step is searching a first set of motion 
vector candidates in a grid of sub-pixel resolution of a predetmnined square radius 
centered on V, to find a best motion vector V2. Next, a second set of motion vector 
candidates in a grid of sub-pixel resolution of a predetermined square radius centered on 
V2 is searched to find a best motion vector V3. Then, a third set of motion vector 
candidates in a grid of siib-pixel resolution of a predetermined square radius centered on 
V3 is searched to find the best motion vector of the macroblock. 

In an alternate preferred embodiment the present invention a technique for 
estimating high-accurate motion vectors may use different interpolation filters at different 
stages in order to reduce computational complexity. 

Another alternate prefenred embodiment of the present invention selects 
the best vectors and accuracies in a rate-distortion ("RD") sense. This embodiment uses 
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rate-distortion criteria that adapts according to the different motion accuracies to 

determine both the best motion vectors and the best motion accuracies. 

Still further, another alternate preferred embodiment of the present 

invention encodes the motion vector and accuracies with an effective VLC approach. 
This technique uses a VLC table that is interpreted differaitly at difforent coding units, 

according to the associated motion vector accuracy. 

The foregoing and other objectives, features, and advantages of the 

invention will be more readily understood upon consideration of the following detailed 

description of the invention, taken in conjunction with the acconapanying drawings. 

DETAILED DESCRIPTION OF THE INVENTION 

The methods of the present invention are described herein in temis of the 
motion accuracy being modified at each image block. These methods, however, may be 
applied when the accuracy is fixed for the whole sequence or modified on a frame-by-frame 
basis. The present uivention is also described as using Telaior's video encoders (and 
particularly the TeletKjr encoder) as described in Ifae Background of the Invention, Although 
described in terms of Telenor's video encoders, the techniques described herein are 
applicable to any other motion-compensated video coder. 

Most video codo^ use motion vectors with half pixel (or '•1/2-per') accuracy 
and bilinear interpolation. The fbrst version of Telenor's encoder also used l/2-pe2 motion 
vectors and bilinear interpolation. The latest version of Telenor's encoder, however, 
incorporated 1/3-pel vectors and cubic-like interpolation because of the additional 
compression gains. Specifically, at a given macroblock, Telenor's estimates tfie best motion 
vector in two steps shown in FIG. 2, First the Telenor aicoder scardics for the best integer- 
pel va^tor V| (FIG. I) 100. Second, the Telenor encoder searches for the best l/B-pixcl 
accurate vector Vi,3 (FIG. 1) near Vi 102. This second step is shown gr^hically in FIG. 1 
where a total of eight blocks (each having an array of 16x1 6 pixels) in the 3x3 interpolated 
reference frame are checked to find the best match. The motion vectors for these eight 
blocks are r^resented by the ei^t solid dots in the grid centered on V,. In FIG. 1 the best 
match is the block associated to the motion vector V,^ = (Vx, Vy) = (1+1/3, 1). 

The technology of the present invention allows the encoder to choose 
between any set of motion accuracies (for example, 1/2, 1/3, and 1/6-pel accurate motion 
vectors) using either a full search strategy or a fast search strategy. 

11 
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Full-Search AMA Seardi Strategy 

As shown in FIGS. 3 and 4, in the full-search adaptive motion accuracy 
("AMA") search strategy the encoder searches all the motion vector candidates in a grid 
of 1/6-pixel resolution and a "square radius^ (defined herein as a square hlock defined by 
a number of pixels up, a number of pixels down, and a niunber of pixels to both sides) of 
five pixels as shown in FIG. 3 . FIG. 4 shows that the first step of the full-search AMA is 
to search for the best int^er-pel vector Vi (FIG. 1) 104, In the second step of the full- 
search AMA, the encoder searches for the best 1/6-pixeI accurate vector Yy^ (FIG. 3) near 
Vi 106. In other words, the fiill-search AMA modifies the second step of the Telenor's 
process so that the encoder also searches for motion vector candidates in other sub-pixel 
locations in the velocity space. The objective is to find the best motion vector in the grid, 
i.e., the vector that points to the block (in the interpolated reference firame) that best 
matches the current macroblock. Although the full-search strategy is computationally 
complex since it searches 120 sub-pixel candidates, it shows the full potential of this 
preferred method of the present invention. 

A critical issue in the motion vector search is the choice of a measure or 
criterion for establishing which block is the best match for the given macroblock. In 
practice, most methods use either the mean squared error ("MSE") or mean absolute 
difference ("MAD") criteria. The MSE between two blocks consists of subtracting the 
pixel values of the two blocks, squaring the pixel differences, and then taking the 
average. The MAD difference between two blocks is a similar distortion measure, except 
that the absolute value of the pixel differences is computed instead of the squares. If two 
image blocks are similar to each other, the MSE and MAD values will be small. If, 
however, the image blocks are dissimilar, these values will be large. Hence, typical video 
coders find the best match for a macroblock by selecting the motion vector that produces 
either the smallest MSE or the smallest MAD. In other words, the block associated to the 
best motion vector is the one closest to flie given macroblock in an MSE or MAD sense. 

IMbrtunatcly, the MSE and MAD distortion measures do not take into 
account the cost in bits of actually encoding the vector. For example, a given motion 
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vector may minimize the MSE, but it may be very costly to encode with bits, so it may 
not be the best choice firom an coding staadpoint. 

To deal with this, advanced encoders such as those described by Telenor 
use rate-distortion ("RD") criteria of the type "distortion + L*Bits" la select the best 
motion vector. The value of "distortion" is typically the MSE or MAD, "L" is a constant 
that depends on the compression level (i.e., the quantization step size), and "Bits'* is the 
number of bits required to code the motion vector, hi general, any RD criteria of this 
type would work with the present invention- However, in the present invention "Bits" 
include the bits needed for encoding the vector and those for encoding the accuracy of the 
vector. In fact, some candidates can have several "Bits" values, because they can have 
several accuracy modes. For example, the candidate at location (1/2, -1/2) can be thought 
of having 1/2 or 1/6 pixel accuracy. 

Fast-Search AMA Search Strategy 

As shown in FIGS, 5 and 6, in the fast-search adaptive motion accuracy 
("AMA") search strategy the encoder checks only a small set of the motion vector 
candidates. In the first step of the fast-search AMA, the encoder checks the eight motion 
vector candidates in a grid of 1/2-pixel resolution of square radius 1, which is centered on 
Vi 108. is then set to denote the candidate that has the smallest RD cost (i.e,, the best of 
the eight previous vectors and V,) 1 10. Next, the encoder checks the eight motion vector 
locations in a grid of 1/6-pixcl resolution of square radius 1 that is now centered on V^j ^ 
If V2 has the smallest RD cost 1 14, the encoder stops its search and selects V2 as the motion 
vector for the block. Otherwise, V3 is set to denote the best motion vector of the eight 1 16, 
The encoder then searches for a new motion vector candidates in the grid of 1/6-pixel 
resolution of square radius 1 that is centered on V3 118. It should be noted that some of the 
candidates in diis grid have already been tested and can be skipped. The candidate with the 
smallest RD cost in this last step is selected as the motion vector for the block 120. 

Experimental data has shown that, on average, this simple &st search strategy 
typically checis the RD cost of about eighteen locations in sub-pixel space (ten more than 
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Telenor's search strategy), and hence the overall computational complexity is only 
moderately increased. 

The experimental data discussed below in connection with FIGS. 8-1 8 show 
that there is practically no loss in compression pei&nnancc from using this fast-search 
version of AMA. This is because the fast-search AMA search strategy ejqsloits the convexity 
of the "distortion + L*Bits" curve (c.f , "distortion" is known to be convex), by creating a 
path that smartly follows the KD cost from higher to lower levels. 

Alternate embodiments of the mvention replace one or more of the steps 108- 
120. These embodiments have also been effective and have further reduced the number of 
motion vector candidates to check in the sub-pixel velocity space. 

FIG. 7, for example, checks candidates of 1/3-pel accuracy. In this 
embodiment step 112 is replaced by one of three possible scenarios. First, if the best motion 
vector candidate from step 1 10 is at the center of V, (the "integer-pel vector") 130, then the 
encoder checks three candidates of 1/3-pel accuracy between the center vector and the 1/2- 
pel location with the next lowest RD cost 132. Second, if the best motion vector candidate 
from step 1 10 is a comer vector 134, then, the encoder checks the four vector candidates of 
1/3 -pel accuracy that are closest to such comer 136. Third, if the best motion vector 
candidate from step 1 10 is between two comers 138, then, the encoder determines which of 
these two comers has lower RD cost and checks the foxir vector candidates of 1/3-pel 
accuracy that are closest to the line between such comer and the best candidate from step 110 
140. It should be noted that in implementing this process step 138 may be unnecessary 
because if V2 is neither at the center or a comer vector, then it would necessarily be between 
two comers. If the encoder is set to find motion vectors with 1/3-pixcl accuracy, FIG. 7 
could be modified to end rather than continuing with step 114. 

Computation And Memory Savings 

Because step 108 checks only motion vector candidates of 1/2-pixcl 
accuracy, the computation and memory requirements far the hardware or software 
implementation are significantly reduced. To be specific, in a smart implementation 
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embodiment of this j&st-search the reference fi-ame is interpolated by 2x2 in order to obtain 
the RD costs for the 1/2-pel vector candidates. A significant amount of fast (or cache) 
memory &sr a hardware or software encoder is saved as compared to Telenor's ^iproach that 
needed to interpolate the reference frame by 3x3 » In CQn^}arison to the Telenor encoder, this 
is a cache memory savings of 9/4 or a factor of 2.25. The few additional interpolations can 
be done later on a block-by-block basis. 

Additionally, since the interpolations in st^ 1 08 are used to direct the search 
towards the lower values of the RD cost function, a complex filter is not needed for these 
interpolations. Accordingly, computation power may be saved by using a simple bilinear 
filter for step 108. 

Also, other key coding decisions such as selecting the mode of amacroblock 
(e.g., 16x16, four-8x8, etc.) can be done using the 1/2-peI vectors because such decisions do 
not benefit significantly firom using higher accxuracies. Then, the encoder can use a more 
complex cubic filter to interpolate the required sub-pixel values for the few additional vector 
candidates to check in the remaining steps. Since the macroblock mode has already been 
chosen, these final interpolations only need to be done for the chosen mode. 

Use of multiple-filters obtained computation savings of over twenty 
percent in running time on a Sparc Ultra 10 Woikstation in comparison to Telenor's 
approach, which uses a cubic interpolation all the time. Additionally, the fast-memory 
requirements were reduced by nearly half. Also, there was little or no loss in 
compression performance. Comparing one preferred embodiment of the fest-search, 
Benzlcr's technique requires about 70 interpolatioiis per pixel in the Telenor encoder and 
the present invention requires only about 7 interpolations per pixel. 

Coding The Motion Vector And Accuracies With Bits 

Once the best motion vector and accuracy arc determined, the mcoder 
encodes both the motion vector and accuracy values with bits. One approach is to encode 
the motion vector with a given accuracy (e.g., half-pixel accuracy) and then add some extra 
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bits for refinii^g the vector to the highor motion accuracy. This is &e strategy suggested by 
B . Girod, but it is sub-optimal in a rate-distortion sense. 

In one pre&ired embodiment of the present invention, the accuracy of the 
motian vector for a macroblock is first encoded using a simple code such as the one given 
ta Table 1 . Any other table with code lengths {1, 2, 2} could be used as well. The bit rate 
could be further reduced using a typical DPCM approach. 



Code 


Motion 

Accuracy 


I 


1/2-pel 


01 


1/3-pel 


11 


1/6^1 



Table 1. VLC table to indicate the accuracy 
mode for a given macroblock. 
Next, the value of the vector/s in the respective accuracy space is encoded. These bits can 
be obtained from entries of a single VLC table such as the one used in the H26L codec - The 
key idea is that these bits are interpreted differently depending on the motion accuracy for 
the macroblock. For example, if the motion accuracy is 1/3 and the code bits for the X 
component of the difference motion vector are 00001 1, the X component of the vector is 
Vx= 2/3. If the accuracy is 1/2, such code corresponds to Vx= 1 , 

Compared to the Benzler method for encoding the motion vectors with a 
variable length code C'VLC") table that could be used for encoding 1/2 and 1/4 pixel 
accurate vectors, the method of the present invention can be used for encoding vectors of 
any motion accuracy and the table can be interpreted differently at each frame and 
macroblock* Further, the general method of the present invention can be used for any 
motion accuracy, not necessarily those that are multiples of each other or those that are of 
the type l/n (with n an integer). The number of increments in the given sub-pixel space is 
simply coimted and the bits in the associated entry of the table is used as the code. 
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From the decoder's viewpoint, once the motion accuracy is decoded, the 
motion vector can also be easily decoded After that, the associated block in the previous 
firame is reconstructed using a typical 4-t^ cubic interpolator. There is a different 4-tap 
filt^ for each motion accuracy. 

The AMA does not increase decoding complexity, because the number of 
operations needed to reconstruct the predicted block are the same, regardless of the 
motion accuracy. 

Experimental Results 

FIGS. 8-18 show test results of the Telenor encoder codec with and without 
AMA in a variety of video sequences, resolutions, and frame rates, as described in Table 2. 
These figures show rate-distortion ("RD") plots for each case. The "Anchor" curve shows 
RD points from optimized H.2634 (FIGS. 8 and 9 only). The "Telenor ITZ+b" curve shows 
Telenor with 1/2-pcl vectors and bilinear interpolation (the "classical case")- The "Telenor 
1/3" curve shows the current Telenor proposal (the "Telenor encoder"). The 
"Telenor+AMA+c'' curve shows the Telenor encoder with the full-search strategy of the 
present invention. The " Telenor +FSAMA+c", as shown in FIGS. 15-17, shows the current 
Telenor encoder with the fast-search strategy. (Unless otherwise specified, the full-search 
version of AMA was the encoder strategy used in the experiments.) All of the test results 
were cross-checked at the encoder and decoder. These results show that with AMA the gains 
in peak signal-to-noise ratio ("PSNR") can be as high as 1 dB over H26L, and even higher 
over the classical case. 



Video sequence 


FIG.# 


Resolution 


Frame rate 


Container 


FIG. 8 


QCIF 


10 


News 


na9 


QCBF 


10 


Mobile 


FIG. 10 


QCIF 


10 
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FIG. 11 


SEF 


15 


Garden 


FIG. 12 


QCIF 


15 


Tempete 


FIG. 13 


SIF 


15 




no. 14 


QCIF 


15 


Paris Shated, 


no. 15 


QCIF 


10 



Table 2. Description of the Experiments 
The video sequences are commonly used by the video coding community, 
exc^t for •'Paris Shaked." The latter is a synthetic sequence obtained by shlfKng the well- 
known sequence "Taris" by a motion vector whose X and Y components take a random value 
within [-1^1]. This synthetic sequence simulates small movements caused by a hand-held 
camera in a typical video phone scene. 

Comparison Of Full-Search And Fast-Search AMA 

The experimental results shown in FIGS. 16 and 17 demonstrate that the 
encoder performance with fast-search ("Telenor FSAMA+c") and full-search ('•Telenor 
AMA-f c"] strategies for AMA is practically the same. This is true because the fast-search 
strategies exploit the convexity of the KD cost curve in the sub-pixel velocity space. In other 
words, since the shape of the RD cost follows a smooth convex curve, its mmimum should 
be easy to find with some smart fast-search schemes that descend down the curve. 

Combining AMA And Multiple Reference Frames 

In the plot shown in FIG. 1 8, the curves labeled " Ir" xised only one reference 
ftamc for the motion compcnsatioUi so these curves are the same as those presented in FIG. 
10. The curves labeled **5r" used five reference frames. 

The e3q)erimcnt5 show that the gains with AMA add to fliose obtained using 
multiple reference frames. The gam from AMA in the one-reference case can be measured 
by comparing the green and pink curves, and the gain in the five-reference case can be 
measured between the blue and red curves. 
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It should be noted that the present invention may be implemented at the 
frame level so that difSsrcat frames could use different motion accuracies, but within a firiame 
all motion vectors would use the same accuracy. Preferably in this embodiment the motion 
vector accuracy would then be signaled only once at the frame layer. Experiments have 
shown that using the best, fixed motion accuracy for the whole frame should also produce 
compression gains as those presented here for the macroblock-adaptive case. 

In another frame-based embodiment the encoder could do motion 
compensation on the entire frame with the different vector accuracies and then select the best 
accuracy according to the ED criteria. This approach is not suitable for pipeline, one-pass 
encoders, but it could be appropriate for software-based or more complex encoders. Still 
another fame-based embodiment the encoder could use previous statistics and/or formulas 
to predict what will be the best accuracy for a given frame (e.g., the formulas in set forth in 
the Ribas work or a variation thereof can be used). This approach would be well-suited for 
one-pass encoders, although the performance gains would depend on the precision of the 
formulas used for the prediction. 

The terms and expressions which have been employed in the foregoing 
specification arc used therein as terms of description and not of limitation, and there is no 
intention, in the use of such terms and expressions, of excludbig equivalents of the 
features shown and described or portions thereof, it being recognized that the scope of the 
invention is defined and limited only by the claims that follow. 

4. Brief Description of Drawings 

FIG. 1 is a diagram of an exemplary full-pel and 1/3-pel locations in 

velocity space. 

FIG. 2 is a flowchart illustrating a prior art method for estimating the best 

motion vector. 

FIG. 3 is a diagram of an exemplary location of motion vector candidates 
for full-search in sub-pixel velocity space. 
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FIG. 4 is a flowchart illustrating a full-search prefmred embodiment of the 
method for estimating the best motion vector of the present inventioa. 

FIG. 5 is a diagram of an exemplary location of motion vector candidates 
for fast-search in sub-pixel velocity space* 

FIG. 6 is a flowchart illustrating a fest-search prefexred embodiment of the 
method for estimatiTig the best motion vector of the present invention. 

FIG. 7 is a detail flowchart illustrating an alternate preferred embodiment 
of step 114 of FIG. 6. 

FIG. 8 is a graphical representation of experimental perfiamance results of 
the Telenor encoder with and without AMA in the "Container** video sequence, with 
QCIF resolution, and at the frame rate of 10 frames per second* 

FIG. 9 is a graphical representation of experimental perfonnance results of 
the Telenor encoder with and without AMA in the "News" video sequence, with QCIF 
resolution, and at the frame rate of 10 frames per second, 

FIG. 10 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Mobile" video sequence, with 
QCEF resolution, and at the frame rate of 10 frames per second. 

FIG. 1 1 is a graphical representation of experimental perfonnance results 
of the Telenor encoder with and without AMA in the "Garden" video sequmce, with SIF 
resolution, and at the frame rate of 15 frames per second. 

FIG. 12 is a graphical representation of experimental performance results 
of the Telenor encoder with and without AMA in the "Garden" video sequence, with 
QCIF resolution, and at the frame rate of 15 frames per second. 

FIG. 13 is a graphical representation of experimental perfbimance results 
of the Telenor encoder with and without AMA in the 'Tempete" video sequence, vnth 
SIP resolution, and at the frame rate of 15 frames per second. 

FIG. 14 is a graphical representation of experimental perfonnance results 
of the Telenor encoder with and without AMA in the "Tcmpctc" video sequence, vrith 
QCIF resolution, and at the frame rate of 15 fiiunes per second. * 

FIG, 15 is a graphical representation of experimental performance results 
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of the Telenor encoder with and without AMA in the "Paris shaked'* video sequence^ with 
QCIF resolution, and at die frame rate of 10 frames per second* . 

FIG. 16 is a graphical representation of experimental performance results 
of fast-search ("Telenor FSAMA+c") and full-search ("Telenor AMA+c") strategies in 
the "Mobile" video sequence, with QCIF resolution, and at the frame rate of 10 frames 
persecond- 

FIG. 17 is a graphical representation of experimental perfiDamance results 
of fast-search ('Telenor FSAMA+c") and full-search ("Telenor AMA+c") strategies in 
the "Container" video sequroce, with QCIF resolution, and at the franie rate of 10 frames 
per second. 

FIG, 1 8 is a graphical representation of experimental performance results 
of tests using only one reference frame for motion coropensation as compared to tests 
using multiple reference firames for motion compensation the in the **Mobile" video 
sequence, with QCIF resolution, and at the frame rate of 10 frames per second. 
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The encoder searches 
for the best integer-pel vector V1 . 


100 












The encoder searches for 
the best 1/3-pixel accurate 
vector Viy3 near V1 . 
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FIG.3 
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FIG.4 






The encoder searches 
for the best integer-pel vector VI. 
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The encoder searches for 
the best 1/6-pixel accurate 
vector VI /6 near V1 . 
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FIG.5 
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FIG.6 



Checking the eight motion vector candidates in a grid of 
1/2-pixel resolution of radius 1 which is centered on VI. 
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Denoting the candidate that has the smallest RD cost as V2. 
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Checking the eight motion vector locations in a grid of 
1/6-pixel resolution of radius 1 that is now centered on \i2. 
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Denoting the best moton vector of the eight as VS. >^ 
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Checking tiie new motion vector candkJates in 

the grid of 1/6-plxel resolution ofradius 1 that is 
centered on V3. Observe that some of the candidates in 
the grid have already been tested and can be skipped. 



h 

118 



Selecting the candidate with the smallest 
RD cost as the motion vector for the block. 
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Checking three candidates 
of 1/3-peI accuracy between the 
center vector and the 1/2-pel 
location with the next lowest RD cost. 
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Checking the four vector 
candidates of 1/3-pei accuracy 
that are closest to such comer. 
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Determining which of the two comers 

has lower RD cost and checking 
the four vector candidates of 1/3-pei 
accuracy that are closet to the line 
between such comer and V2. 
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FIG.17 
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1 . Abstract 

Methods for motion estimation with adaptive motion accuracy of the 
present invention include several techniques for computing motion vectors of high pixel 
accuracy with a minor increase in computation, One technique uses fast-search strategies 
in sub-pixel space that smartly searches for the best motion vectors, An altcmate 
technique estimates high-accurate motion vectors using different interpolation filters at 
different stages in order to reduce computational complexity. Yet another technique uses 
rate-distortion criteria that adapts according to the different motion accuracies to 
determine both the best motion vectors and the best motion accuracies. Still another 
technique uses a VLC table that is interpreted differently at different coding units, 
according to the associated motion vector acciuracy. 



2 , Representative Drawing 
Fig. 6 



